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Introduction.  The  objective  of  our  work  is  the  high  throughput  synthesis  of  DNA  minor  groove  binding  agents 
based  on  the  distamycin  structure  and  their  screening  for  inhibition  of  androgen  receptor  mediated  gene 
transcription,  which  is  unregulated  in  chemotherapeutic  resistant  prostate  cancer  and  responsible  for  disease 
progression.  It  represents  a  novel  and  unique  new  target  for  the  treatment  of  relapse  prostate  cancer  where 
prognosis  is  presently  very  poor. 

Body.  The  results  to  date  in  our  studies  are  presented  below  and  refer  to  Task  1-Task  4  of  our  proposals’ 
statement  of  work. 

Task  1 :  Prepare  combinatorial  libraries  of  minor  groove  DNA  binding  compounds  related  to  distamycin. 

Year  3:  No  new  efforts  on  the  library  synthesis  were  conducted  in  year  3.  The  efforts  were  completed  in  years  1 
and  2. 

Cumulative  accomplishments:  Two  libraries  of  DNA  binding  compounds  were  prepared  enlisting  the  protocols 
and  structures  outlined  in  the  year  1  and  year  2  progress  reports.  This  provided  2,640  compounds  (library  1)  and 
4,394  compounds  (library  2)  for  examination.  (One  aspect  of  this  work  was  disclosed  in  publication  1  and  the 
preparation  of  the  first  library  was  summarized  in  publications  2  and  3). 

Task  2:  Screen  the  libraries  for  inhibiting  activity  in  a  prostate  cancer  cell  line,  a  LI  210  cell  line,  and  for  those 
that  have  high  affinity  binding  to  ARE’s  (androgen  response  element). 

Year  3:  The  completion  of  the  screening  of  the  library  2  was  accomplished  in  year  3.  The  library  1  screening 
was  completed  in  year  2. 

Cumulative  accomplishments:  The  two  libraries  composed  of  7,034  compounds  were  screened  for  DNA 
binding  to  two  ARE’s  consensus  sequences  (specific  DNA  sequences)  and  for  growth  inhibiting  activity  against 
a  prostate  cancer  cell  line  and  the  L1210  cancer  cell  line.  Notably,  the  DNA  binding  screening  enlisted  a  new 
technique  that  we  introduced  (a  fluorescent  intercalator  displacement  (FID)  assay)  which  permits  the  high 
throughput  screening  for  DNA  affinity  and  selectivity.2  This  required  that  we  screen  the  library  of  compounds 
(7,034  compounds)  against  two  hairpin  dioxyoligonucleotides,  each  containing  an  ARE  consensus  sequence, 
using  the  FID  assay  to  establish  DNA  binding  affinity.  (A  seminal  review  of  our  introduction  and  development 
of  the  FID  assay  is  provided  in  publication  5). 

Task  3:  Define  DNA  binding  selectivity  of  identified  leads. 

Year  3:  A  total  of  twelve  compounds  that  emerged  from  the  above  screening  efforts  were  examined  for  their 
DNA  binding  selectivity. 

Cumulative  accomplishments:  The  DNA  binding  selectivity  of  the  lead  structures  (12  compounds)  that  emerged 
from  the  Task  2  screening  was  defined.  This  entailed  the  use  of  the  FID  assay,  this  time  to  screen  each  of  the  12 
compounds  against  a  full  library  of  512  hairpin  deoxyoligonucleotides.2  Each  hairpin  oligo  contains  a  different 
DNA  sequence  representing  one  of  all  possible  5  base  pair  sites.2  The  screening  results  provide  a  high 
resolution  and  comprehensive  definition  of  each  compounds  DNA  binding  selectivity  including  an  assessment 
of  selectivity  for  a  sequence  in  the  ARE’s  over  all  other  possible  sites.  (A  seminal  review  of  our  introduction 
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and  development  of  the  FID  assay  is  provided  in  publication  5  and  the  behavior  of  our  best  lead  compound, 
128,  is  summarized  in  publication  3). 

Task  4:  Determine  the  identified  compounds  effect  on  AR-mediated  gene  transcription  and  transactivation. 

Year  3:  The  twelve  compounds  were  examined  in  the  assay  we  detailed  in  the  original  proposal. 

Cumulative:  The  most  effective  compound  identified  is  a  compound  we  refer  to  as  128.  We  found  that  128 
potently  (IC50  =  8  nM)  and  selectively  ( ca .  100-fold)  inhibits  androgen  receptor  mediated  gene  transcription  in 
the  assay  we  detailed  in  the  proposal,  but  only  if  it  is  delivered  artificially  into  the  cell  using  liposomes.2c,3b 
(This  work  has  been  disclosed  in  publications  3  and  5).  Thus,  the  strategy  outlined  does  identify  candidate 
compounds  useful  for  potential  treatment  of  therapeutic  resistant  prostate  cancer.  However,  we  have  not  yet 
identified  derivatives  that  enter  cells  better  than  128  or  inhibit  androgen  receptor  mediated  gene  transcription  (in 
a  cellular  functional  assay)  in  the  absence  of  liposomal  delivery. 
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We  also  examined  the  feasibility  of  (1)  altering  the  terminal  amine  pK&  and  (2)  removing  this  charged 
group  (which  inhibits  cell  penetration)  and  attaching  a  novel  reversible  alkylating  agent  capable  of  increasing 
(replacing)  the  lost  DNA  binding  affinity  that  results  from  removing  the  charged  group.4  In  these  studies,  we 
did  not  identify  such  derivatives  among  the  prototypical  compounds  prepared  (e.g.  1).  That  is,  altering  the 
terminal  amine  reducing  the  pKa  simply  reduced  DNA  binding  affinity  without  improving  cell  penetration  and 
the  attachment  of  such  a  reversible  alkylating  group  did  not  increase  DNA  binding  affinity  (i.e.,  no  apparent 
alkylation).  (This  latter  work  was  disclosed  in  publication  4). 
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Key  Research  Accomplishments. 

•  Development  and  implementation  of  a  novel  solution-phase  approach  to  the  preparation  of  libraries  of 
sequence  selective  DNA  binding  compounds  (high  throughput  synthesis).1 

•  Development  of  a  novel  high  throughput  screen  for  establishing  DNA  binding  selectivity  or  affinity. 
This  includes  the  introduction  of  the  first  high  throughput  screen  for  a  defined  DNA  sequence  (i.e., 
androgen  response  element)  that  can  control  aberrant  gene  transcription.2  We  recently  reviewed  our 
efforts  on  the  development  of  this  screen  and  highlighted  its  application  to  inhibition  of  androgen 
receptor  mediated  gene  transcription  (publications  3  and  5).2c,3b 

•  Definition  of  previously  unexamined  features  of  distamycin  responsible  for  its  DNA  binding  affinity 
(publication  1). 

•  Highlighted  the  feasibility  of  discovering  inhibitors  of  androgen  mediated  gene  transcription  with 
sequence  selective  minor  groove  DNA  binding  compounds,  albeit  presently  requiring  liposomal  delivery 
of  the  compounds  into  a  cell  (publications  2,  3  and  5). 
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Reportable  Outcomes. 


Publications. 

1.  C.  R.  Woods,  N.  Faucher,  B.  Eschgfaller,  K.  W.  Bair,  and  D.  L.  Boger,  Synthesis  and  DNA  binding 
properties  of  saturated  distamycin  analogues,  Bioorg.  Med.  Chem.  Lett.  2002, 12,  2647-2650. 

2.  D.  L.  Boger,  Solution-phase  synthesis  of  combinatorial  libraries  designed  to  modulate  protein-protein  or 
protein-DNA  interactions,  Bioorg.  Med.  Chem.  2003,  11,  1607-1613. 

3.  D.  L.  Boger,  J.  Desharnais,  and  K.  Capps,  Solution-phase  combinatorial  libraries:  modulating  cellular 
signaling  by  targeting  protein-protein  or  protein-DNA  interactions,  Angew.  Chem.  Int.  Ed.  2003,  42,  4 1 38— 
4176. 

4.  B.  K.  S.  Yeung  and  D.  L.  Boger,  Synthesis  of  isochrysohermidin-distamycin  hybrids,  J.  Org.  Chem.  2003, 
68,  5249-5253. 

5.  W.  C.  Tse  and  D.  L.  Boger,  A  fluorescent  intercalator  displacement  (FID)  assay  for  establishing  DNA 
binding  selectivity  and  affinity,  Acc.  Chem.  Res.  2004,  37,  61-69. 

Conclusions.  The  work  progressed  well.  The  importance  of  the  work  includes  not  pnly  the  potential 
development  of  a  treatment  for  relapse  (resistant)  prostate  cancer,  but  it  defines  a  new  approach  to  treating 
diseases  arising  from  aberrant  gene  transcription  and,  importantly,  provides  the  first  scientific  tools  to  approach 
this  problem  (high  throughput  synthesis  and  screening  technology  for  DNA  binding  compounds). 
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Appendix.  None-attached.  Reprints  of  Publications  1  5  provided  in  prior  annual  reports. 

We  have  now  attached  reprints  of  publications  1-5  even  though  they  were  provided  in  earlier  progress  reports 
as  well. 

Contractural  Issues: 

Tasks  1-4  have  been  completed.  Reprints  of  Publications  have  been  attached. 

Technical  Issues: 

(1)  The  PI  did  introduce  and  develop  the  fluorescent  intercalator  displacement  (FID)  assay. 

(2)  A  detailed  review  of  this  technique  was  disclosed  in  publication  5. 
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Abstract — A  series  of  saturated  heterocyclic  analogues  of  distamycin  were  prepared  and  examined.  A  fluorescent  intercalator  dis¬ 
placement  (FID)  assay  conducted  on  p[dA}-p[dT]  DNA  to  obtain  C50  values  and  a  hairpin  deoxyoligonucleotide  containing  an  A/T- 
rich  binding  site  was  used  to  evaluate  DNA  binding  affinity.  It  is  observed  that  saturated  heterocycles  greatly  reduce  the  DNA  binding 
relative  to  distamycin.  ©  2002  Elsevier  Science  Ltd.  All  rights  reserved. 


Polyamides  composed  of  V-methylpyrrole  (Py), 
V-methylimidazole  (Im),  and  a  growing  set  of  structural 
analogues  bind  in  the  DNA  minor  groove  with  pre¬ 
dictable  sequence  selectivity  and  high  affinities.1  Several 
factors  contribute  to  the  success  of  polyamide  binding 
including  hydrogen  bonding,  curvature,  Van  der  Waals 
contacts,  and  charged  end  groups.  Each  component  has 
been  explored  through  the  preparation  of  analogous 
systems  designed  to  probe  the  magnitude,  requirements, 
and  relative  importance  of  each  feature.  Yet,  the  bind¬ 
ing  effectiveness  for  polyamides  lacking  the  it-system  of 
the  integral  heteroaromatic  rings  has  been  relatively 
unexplored.  Herein  we  report  the  synthesis  and  evalua¬ 
tion  of  a  series  of  saturated  cyclic  polyamides  1-5  (Fig.  1), 
based  on  the  known,  effective  DNA  minor  groove 
binding  agent  distamycin.2  Saturation  removes  the 
7t-system,  alters  the  conformation,  and  increases  the 
thickness  of  the  compound,  yet  maintains  the  relative 
structure  of  distamycin  and  was  used  to  assess  the 
importance  of  the  rc-system.  A  fluorescent  intercalator 
displacement  (FID)  assay  conducted  on  p[dA]-p[dT] 
DNA  to  obtain  C50  values  and  a  hairpin  deox¬ 
yoligonucleotide  containing  an  A/T-rich  binding  site 
was  used  to  evaluate  DNA  their  binding  affinity. 

A  solution-phase  synthesis  of  the  polyamides  was  con¬ 
ducted  using  a  series  of  l-(3-dimethylaminopropyl)-3- 


*Corresponding  author.  Tel.:  +  1-858-784-7522;  fax:  + 1-858-784- 
7550;  e-mail:  boger@scripps.edu 


ethylcarbodiimide  hydrochloride  (EDCI)  mediated  cou¬ 
pling  reactions  as  described  previously3  where  workup, 
isolation,  and  purification  could  be  addressed  princi¬ 
pally  by  liquid-liquid  acid-base  extractions.  Each  series 
of  analogues  (1-5)  are  discussed  separately  below. 

N-CBz  pyrrolidine  based  system  (la  and  b).  N-Boc 
deprotection  of  64  (HCl-EtOAc)  provided  the  amine  7 
as  the  hydrochloride  salt  and  methyl  ester  hydrolysis  of 
64(LiOH,  THF-MeOH-H20)  provided  carboxylic  acid 
8  (Scheme  1). 

After  deprotection,  7  and  8  were  coupled  to  produce 
dimer  95  (EDCI,  HOBt,  DMF,  25  °C,  13-20h,  66%) 


distamycin 


X 

la  NCBz 
2a  NH 
3a  O 
4a  CH2 
5a  NMe 

X 

1b  NCBz 
2b  NH 
3b  O 


H 


O  NH 


Figure  1.  Distamycin  and  saturated  heterocyclic  polyamides  1-5. 
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Scheme  1. 

(Scheme  2).  N-Boc  deprotection  of  9  (HCl-EtOAc,  25  °C, 

I  h)  and  coupling  with  8  (EDCI,  HOAt,  DMF,  25  °C, 
20  h,  73%)  provided  trimer  10.6  The  methyl  ester  of  10 
was  then  hydrolyzed  (2.5equiv  NaOH,  THF-H20,  1.5  h, 
91%)  to  give  the  free  carboxylic  acid  which  was  coupled 
with  3-aminopropionitrile  to  afford  ll7  (EDCI,  HOAt, 
DMF,  25  °C,  20  h,  75%).  End  group  functionalization  of 

II  was  accomplished  by  simultaneous  N-Boc  deprotec¬ 
tion  and  imidate  formation  (HCl-EtOH),  amidine  for¬ 
mation  (NH3-EtOH,  25  °C,  2-24  h),  and  TV-formylation 
(yV-formyl  imidazole,  THF-MeOH,  -40  °C,  2  h)  without 
purification  of  the  intermediates  yielding  la8  in  56%  over 
the  three  steps.  The  enantiomer  lb8  was  produced  in  the 
same  manner  starting  from  the  enantiomer  of  6.4 

Pyrrolidine  based  system  (2a  and  b).  Several  hydro- 
genolysis  conditions  for  N-CBz  deprotection  of  la  and  b 
were  examined  to  produce  2a  and  b  (Table  1,  entries  1-5) 


x  £ 

6  NCBz  8  NCBz 

12  O  14  O 

18  CH2  20  CH2 


Table  1.  Hydrogenolysis  conditions  for  la  using  10%  Pd/C 


Entry 

Conditions 

Result 

1 

H2,  MeOH,  25  °C,  24-48  h 

No  reaction 

2 

H2,  MeOH,  2  N  HCI,  25  °C,  24  h10 

Decomposition 

3 

4%  HCOOH,  MeOH,  25°C,  14h" 

Decomposition 

4 

H2,  acctonc-McOH,  25  °C,  24  h12 

No  reaction 

5 

H2,  2equiv  HOAc,  MeOH,  25  °C,  24  h 

No  reaction 

6 

H2,  lOequiv  TFA,  MeOH,  25 °C,  4h 

2a,  64% 

before  efficient  conditions  were  found  (H2,  10%  Pd/C, 
lOequiv  TFA,  MeOH,  25  °C,  4h,  entry  6).  Thus, 
although  N-CBz  deprotection  was  slow  and  typically 
problematic,  acid  catalysis  with  inclusion  of  TFA  pro¬ 
vided  2  in  good  conversions. 

Furan  based  system  (3a  and  b).  Transformation  of  the 
monomer  124  and  its  enantiomer  into  their  respective 
functionalized  trimers  3a9  and  b12  was  performed  using  the 
strategy  detailed  above  for  la  and  b  (Schemes  1  and  2). 
However,  the  tetrahydrofuran-based  compounds 
proved  highly  water  soluble.  Thus,  removal  of  excess 
starting  materials,  reagents,  and  side  products  by 
simple  acid-base  extraction  was  not  possible  and 
chromatographic  purifications  were  required. 

Cyclopentane  based  system  (4a).  Transformation  of  the 
monomer  184  into  4a  was  performed  uneventfully  using 
the  strategy  detailed  above  (Schemes  1  and  2). 

N-Me  pyrrolidine  based  system  (5a).  N-Boc  deprotec¬ 
tion  (HCl-dioxane)  of  244  provided  the  amine  25  as  the 
hydrochloride  salt  and  methyl  ester  hydrolysis  of  244 
(LiOH,  THF-MeOH)  provided  the  lithium  carboxylate 
2613  (Scheme  3). 

Following  the  strategy  detailed  above,  the  lithium  salt 
26  was  used  to  provide  trimer  28  (Scheme  4).  However, 
coupling  of  the  carboxylic  acid  derived  from  28  with 
3-aminopropionitrile  afforded  29  in  only  moderate  yield 


Table  2.  C50  values 


Compd 

Cso  (1*M) 

Distamycin 

0.46 

la 

4.70 

lb 

5.10 

2a 

6.37 

2b 

8.55 

3a 

5.40 

3b 

5.50 

4a 

7.91 

5a 

6.70 

p[dA]-p[dT]  8.8  pM  bp.  EtBr  4.4  pM. 


Scheme  2. 
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(20%,  Scheme  2).  Consequently,  an  alternative  con¬ 
vergent  synthesis  was  used  to  provide  29  (Scheme  4). 
Coupling  26  with  3-aminopropionitrile  (HATU, 
/-Pr2NEt,  DMF,  25  °C,  80%)  yielded  nitrile  monomer 
30,  which  was  subsequently  N-Boc  deprotected  (HC1- 
EtOAc)  and  coupled  with  the  carboxylic  acid  derived 
from  27  (LiOH,  THF-MeOH-H20)  using  standard 
coupling  conditions  (HATU,  /-Pr2NEt,  DMF,  25  °C, 
60%)  to  provide  29  in  a  more  satisfactory  yield.  Treat¬ 
ment  of  29  with  freshly  prepared  8  N  HCl-EtOH  led  to 
N-Boc  deprotection  and  imidate  formation.  Subsequent 
treatment  of  the  imidate  with  NH3-EtOH  provided 
the  corresponding  amidine  which  was  formylated  using 
A-formyl  imidazole  freshly  prepared  by  treating 
carbonyldiimidazole  with  formic  acid  to  provide  5a.14 

Initial  assessment  of  DNA  binding.  The  compounds  1-5 
were  initially  screened  for  DNA  binding  using  p[dA]- 
p[dT]  in  a  fluorescence  intercalator  displacement  (FID) 
assay.15  The  relative  binding  affinities  were  established 
by  monitoring  the  loss  of  fluorescence  derived  from 
titration  displacement  of  prebound  ethidium  bromide 
from  p[dA]-d[dT].  A  comparison  the  compound  con¬ 
centration  required  for  50%  displacement  of  the  ethi¬ 
dium  bromide  (C5o)  revealed  that  none  of  the  saturated 
analogues  compared  well  to  distamycin  (Table  2).  The 
best  of  the  analogues,  N-CBz  pyrrolidine  la,  required  a 
10-fold  greater  concentration  relative  to  distamycin 
(4.70  vs  0.46  pM),  while  the  remaining  analogues 
required  even  higher  concentrations  (5.10-8.55  pM). 
Interestingly,  one  stereochemical  configuration,  (/?)c. 
ring(‘S)N-ring  (the  a  analogues),  consistently  performed 
better  than  their  respective  enantiomers. 

Binding  to  a  hairpin  deoxyoligonucleotide  containing  an 
A/T-rich  binding  site.  In  an  attempt  to  assess  binding 
constant  comparisons  of  1-5,  a  FID  titration  assay  was 
performed  using  a  DNA  hairpin  with  an  A/T-rich 
binding  site  of  5  bp.  As  above,  the  analogues  exhibited 
low  binding  affinity  toward  the  hairpin  DNA,  yielding 
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flat  fluorescence  reduction  curves  too  shallow  for  accu¬ 
rate  Scatchard  analysis.16  Qualitatively,  their  behavior 
can  be  compared  using  the  ratio  of  fluorescence  present 
before  (F,0o%)  and  after  the  addition  of  one  equivalent 
of  agent  (Fi),17  using  the  Fi/F100%  value  (Table  3).  With 
distamycin,  a  F1/F]0o%  of  0.48  is  observed,  whereas 
much  higher  values  are  observed  with  the  saturated 
systems  (0.77-0.92).  The  trends  for  hairpin  and  p[dA]- 
p[dT]  DNA  binding  (Fi/F10o%  versus  C50)  are  similar 
with  the  N-CBz  derivatives  (la  and  b)  exhibiting  the 
better  binding  properties  than  the  remaining  analogues. 
However,  additional  insights  are  difficult  to  extrapolate 
due  to  the  low  binding  ability. 


Discussion 

One  major  impact  of  saturating  the  distamycin  pyrrole 
rings  with  2  or  5  is  the  introduction  of  three  additional 
basic  nitrogens.  Those  of  2  are  present  as  secondary 
amines  while  those  of  5  are  tertiary  amines.  Although 
not  observed,  one  might  have  suspected  that  pH  7  pro¬ 
tonation  of  these  amines  might  provide  positively 
charged  centers  that  would  benefit  from  stabilizing 
electrostatic  interactions  with  DNA  enhancing  their 
affinity.  In  addition,  the  comparable  behavior  of  2  and  5 
with  either  3  or  4,  suggest  that  the  presence  of  the  basic 
amines  do  not  enhance  or  inhibit  binding  relative  to  the 
impact  of  introduction  of  the  saturated  versus  unsaturated 


Table  3.  F]/F|oo%  values  of  distamycin  agents  1-5  with  hairpin  DNA 

5,CGA-A-A-A-A-Ca‘a 
3’  GC-T-H-H-GfrA 


Compd 

F,  /  F|0o%“ 

Curve  description 

Distamycin 

0.4S 

Good 

la 

0.88 

Flat 

lb 

0.77 

Flat 

2a 

0.90 

Flat 

2b 

0.88 

Flat 

3a 

0.90 

Flat 

3b 

0.90 

Flat 

4a 

0.92 

Flat 

5a 

0.83 

Flat 

aF|/F|0o%  is  the  ratio  of  fluorescence  before  (Fmo%)  and  after  the 
addition  of  1  equivalent  of  agent  (F,). 
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heterocycles.  Interestingly,  the  most  effective  analogue 
in  the  series  is  1  bearing  the  hydrophobic  N-CBz  pro¬ 
tecting  group.  However,  even  1  would  not  be  considered 
an  effective  DNA  binding  agent.  Finally,  the  all  carbon 
cyclopentyl-based  analogue  4  emerged  in  the  DNA  bind¬ 
ing  assays  as  the  poorest  analogue  in  the  series. 

Although  several  features  imparted  by  the  structural 
changes  could  be  responsible  for  the  poor  binding  of  1- 
5,  the  magnitude  of  the  reductions  is  notable.  From 
molecular  modeling  studies,  a  minor  groove  com¬ 
plementary  shape,  hydrogen  bonding,  and  stabilizing 
electrostatic  interactions  analogous  to  those  observed 
with  distamycin  are  possible  with  1-5  and  the  hydro- 
phobic  character  covered  by  the  series  brackets  that  of 
distamycin.  Nonetheless,  no  saturated  analogue  in  the  ser¬ 
ies  came  close  to  the  DNA  binding  affinity  observed  with 
distamycin  illustrating  the  importance  of  the  unsaturated 
heterocycles.18 
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Abstract — A  short  personal  perspective  on  the  development  of  an  approach  to  the  solution-phase  synthesis  of  combinatorial  libraries 
for  modulating  cellular  signaling  by  inhibiting,  promoting,  or  mimicking  protein-protein  or  protein-DNA  interactions  is  provided. 
©  2003  Elsevier  Science  Ltd.  All  rights  reserved. 


Over  the  course  of  the  last  several  years,  we  have  devel¬ 
oped  technology  for  the  solution-phase  preparation  of 
combinatorial  libraries  with  several  key  applications  in 
mind.  Because  this  work  was  necessarily  published  at 
different  intervals  across  a  range  of  chemical  or  biological 
journals,  the  strategic  decisions  we  made  in  pursuing  this 
work  and  the  intimate  relationship  between  the  technology 
development  and  the  intended  applications  is  not  likely 
to  be  clear  to  the  casual  reader.  Consequently,  I  would 
like  to  take  the  opportunity  of  this  Perspective  Articles 
series  to  summarize  our  contributions  to  solution-phase 
library  synthesis  highlighting  its  development  in  the 
context  of  our  intended  applications  for  modulating 
cellular  signaling  through  inhibiting,  promoting,  or 
mimicking  protein-protein  or  protein-DNA  inter¬ 
actions.  Despite  the  prevailing  bias  that  such  targets 
might  not  prove  viable  for  small  molecule  intervention, 
the  screening  of  our  libraries  against  each  such  target  has 
provided  the  first  small  molecule  modulators  of  the  pro¬ 
tein-protein  or  protein-DNA  interaction  and  validated 
the  targets  for  small  molecule  therapeutic  intervention. 


Solution-Phase  Combinatorial  Library  Synthesis 

Combinatorial  chemistry  has  undergone  rapid  develop¬ 
ment  and  has  provided  a  new  paradigm  for  drug  dis¬ 
covery.  As  a  consequence  of  its  extension  from  peptide 
and  oligonucleotide  synthesis,  most  approaches  have  relied 
on  solid-phase  synthesis  techniques.  A  complement  to 
adapting  solution-phase  chemistry  to  polymer-supported 
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combinatorial  synthesis  is  the  development  of  protocols 
for  solution-phase  combinatorial  synthesis.  We  intro¬ 
duced  a  simple  protocol  that  permits  the  multistep 
synthesis  of  chemical  libraries  employing  liquid-liquid  and 
liquid-solid  extractions  to  remove  unreacted  starting 
materials,  reagents  and  reagent  by-products,  providing 
the  purified  product  (>95%  pure)  irrespective  of  the 
reaction  efficiency  (Fig.  I).1-34  It  has  been  implemented 
in  formats  for  the  synthesis  of  individual  compounds1-4 
(1000-member  libraries),  modest  sized  libraries  composed 
of  small  mixtures  (1000-  to  10,000-member  libraries,  10-50 
compounds/mixture),6  or  combinatorially  assembled  to 
provide  large  libraries  including  positional  scanning  or 
deletion  synthesis  libraries8-13-21 -34  (25,000-  to  1,000,000- 
member  libraries,  100-28,000  compounds/mixture).7-8 
This  allows  the  protocol  to  be  adopted  in  a  format 
compatible  with  any  screening  objective.  Thus,  its 
implementation  is  convenient  for  either  lead  discovery 
or  lead  optimization  and  produces  the  library  members 
on  a  scale  (5-150  mg)  that  allows  their  repeated  use  in 
screening  without  resynthesis.  It  is  this  latter  feature 
along  with  its  technically  nondemanding  implementation 
that  we  consider  its  greatest  attributes.  We  presently 
have  40,000  compounds  in  the  small  (1000-member)  and 
medium-sized  (1000-10,000-member)  libraries  and 
efforts  are  ongoing  to  expand  this  to  approximately 
1,000,000  compounds.  Such  libraries  have  been  pre¬ 
pared  that  interfere  with  (antagonists)  or  mimic  (agonists) 
extracellular  or  intracellular  protein-protein  inter¬ 
actions,35"37  inhibit  intracellular  enzymes,  or  modulate 
protein-DNA  interactions.  The  chemistry  is  applicable 
to  non-natural  and  natural  product  scaffolds,  and  cyclic 
(depsi)peptides  and  possesses  a  scope  that  exceeds  what 
one  might  initially  imagine  based  on  its  simplicity. 
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Highlights  of  the  methodology  developed  for  the  pre¬ 
paration  of  chemical  libraries  include: 

•  A  general  solution-phase  technique  for  the  high- 
throughput  synthesis  of  libraries  in  a  variety  of 
formats  (individual  compounds,  small  mixtures, 
large  mixtures,  positional  scanning  libraries, 
deletion  synthesis  libraries)  was  introduced  that 
is  applicable  to  both  lead  identification  and  lead 
optimization. 

•  Liquid-liquid  or  liquid-solid  (ion-exchange 
resin)  extractions  were  developed. 1,2,11 

•  Immobilized  reagents  for  use  in  the  solution- 
phase  synthesis  of  libraries  were  intro¬ 
duced.2’10^12 

•  The  first  use  of  size  exclusion  chromatography 
(filtration)  for  the  high-throughput  purification 
and  isolation  of  chemical  libraries  was  dis¬ 
closed.1 1 

•  The  first  application  of  an  intermolecular  olefin 
metathesis  reaction  in  the  synthesis  of  chemical 
libraries  was  disclosed.5’7-9 

•  The  approach  is  applicable  to  the  multistep 
synthesis  of  selected  natural  products  and  this  was 
highlighted  with  a  2640-membered  distamycin 
library,20  132-membered  CC-1065  library,24  40- 
membered  HUN-7293  library,18  and  a  prototype 
triostin  A  synthesis.22 

•  Deletion  synthesis  deconvolution  of  mixture 
libraries  was  introduced  and  assessed.8 

•  The  performance  of  positional  scanning  decon¬ 
volution  was  and  continues  to  be  compared 
against  deletion  synthesis  deconvolution,  small 
mixture  screening,  and  individual  compound 
screening.8'13’21’34 

•  Convergent  versus  divergent  strategies  of  library 
synthesis  were  introduced  and  developed  (only 
applicable  to  solution-phase,  not  solid-phase, 
synthesis).7-12 

•  Agonists  from  antagonists  design  concepts 
implemented.32 

•  A  rapid,  high-throughput,  high-resolution  screen 
for  DNA  binding  affinity  and  selectivity  appli¬ 
cable  for  use  in  assaying  libraries  was  developed 
(FID  assay).20,23’25-27 

•  Current  archived  chemical  libraries  contain 
>40,000  compounds  that  are  available  for  con¬ 
tinued  screening  in  new  assays. 

The  approach  avoids  the  disadvantages  of  solid-sup- 
ported  synthesis  including  its  more  restrictive  scale,  the 
required  functionalized  substrates  and  solid  supports, 
compatible  spacer  linkers,  and  the  requirements  for 
orthogonal  attachment/detachment  chemistries.  It  does 
not  require  specialized  protocols  for  monitoring  each 
step  of  multistep  syntheses,  allows  the  purification  of 
intermediates,  and  provides  the  final  pure  products 
directly  for  use  in  binding  or  functional  assays.  We 
described  extensions  of  these  studies  for  generating 
symmetrical  or  unsymmetrical  chemical  libraries  sui¬ 
table  for  probing  receptor  and  protein  homo-  and  het¬ 
erodimerization  events  (Fig.  2).  Thus,  the  preparation 
and  dimerization  linkage  of  iminodiacetic  acid  diamides 


can  be  conducted  in  a  reaction  sequence  that  requires 
only  three  steps.  In  addition  to  the  multiplication  of  the 
diversity  that  arises  through  the  combinatorial  dimer¬ 
ization  linkage  of  the  iminodiacetic  acid  diamides,  the 
solution-phase  synthesis  of  the  intermediates  permits 
their  direct  linkage  which  would  be  precluded  by  solid- 
phase  synthesis  techniques.  As  such,  the  strategy  is 
uniquely  suited  for  taking  advantage  of  such  dimeriza¬ 
tion  (convergent)  strategies  utilizing  a  limited  number  of 
synthetic  steps.  This  modular  approach  to  the  genera¬ 
tion  of  libraries  is  especially  well-suited  for  the  discovery 
of  antagonists  or  agonists  of  receptor  and  protein 
homo-  and  heterodimerization.  Simple  binders  (i.e.,  3) 
can  serve  as  antagonists  of  ligand-induced  receptor  or 
protein  dimerization.  Covalently  linked  symmetrical 
dimers  (i.e.,  4)  can  be  used  to  promote  receptor  or  pro¬ 
tein  homodimerization  whereas  unsymmetrical  dimers 
can  be  utilized  to  promote  receptor  or  protein  hetero¬ 
dimerization.  Thus,  both  antagonists  and  agonists  may 
be  developed  depending  on  the  therapeutic  applica¬ 
tion.41,45 

Target  Protein-Protein  Interactions  Mediating  Cellular 
Signaling 

Cell  growth,  differentiation,  migration,  and  apoptosis 
are  regulated  in  part  by  growth  factors  or  cytokines. 
These  factors  are  unable  to  penetrate  the  cell  membrane 
and  exert  their  effects  by  binding  to  cell  surface  recep¬ 
tors.  In  many  instances,  such  receptors  are  activated  by 
ligand-induced  dimerization  or  oligomerization.38-44  In 
addition,  several  components  of  the  intracellular  signal 
transduction  pathways  are  also  regulated  by  dimerization. 
For  instance,  certain  cytoplasmic  signal  transduction 
molecules  dimerize  after  activation,  and  the  active  form 
of  a  transcription  factor  is  often  a  dimer.42-45  Thus,  protein 
dimerization  has  emerged  as  a  general  mechanism  for 
the  initiation  and  downstream  regulation  of  signal 
transduction.  The  targets  we  are  addressing  constitute 
prototypical  examples  of  these  events  in  signal  transduc¬ 
tion.  Thus,  the  targets  being  pursued  were  chosen  not 
only  for  their  therapeutic  importance,  but  also  because 
each  constitutes  a  distinct  stage  for  modulating  cellular 
signaling  by  controlling  protein-protein  interactions. 

Effective  inhibitors  of  angiogenesis  (new  blood  vessel 
growth)  and  tumor  growth  have  been  discovered  that 
act  by  disruption  of  the  binding  of  matrix  metallo¬ 
proteinase  2  (MMP2)  to  the  cell  surface  integrin  avp3 
validating  the  target  for  therapeutic  intervention  for  the 
treatment  of  cancer  and  providing  the  first  small  mol¬ 
ecule  lead  structures  (Fig.  3).30,31  The  first  small  molecule 
inhibitors  of  Myc/Max  dimerization  have  been  dis¬ 
covered  that  act  at  the  bHLHZip  dimerization  interface 
enlisting  a  novel  FRET  assay  that  were  shown  to  disrupt 
the  binding  of  this  oncogenic  transcription  factor  to 
DNA  and  to  inhibit  Myc  induced  conversion  of  normal 
to  transformed  fibroblasts  validating  an  important  small 
molecule  target  for  the  treatment  of  cancer  (Fig.  4). 33 
Promising  inhibitors  of  LEF-l/P-catenin  mediated  gene 
transcription  have  been  identified  in  a  luciferase  repor¬ 
ter  assay  (TOPFLASH).29  These  inhibitors  and  their 
mechanism  of  action  (inhibition  of  LEF-l/P-catenin 
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Figure  1.  Solution-phase  synthesis  of  libraries. 


Figure  2.  Homo-  or  heterodimerization  of  libraries  for  protein-protein  dimerization. 


binding)  are  being  more  fully  explored  with  the  devel¬ 
opment  of  a  LEF-l/p-catenin  FRET  assay.  The  first 
inhibitors  of  the  binding  of  the  signaling  adapter  protein 
Paxillin  to  the  short  cytoplasmic  tail  of  the  integrin  oc4 
(a4Pi=VLA4,  Very  Late  Antigen  4)  have  been  dis¬ 
covered  and  shown  to  inhibit  cell  migration  validating  a 
new  target  for  the  treatment  of  chronic  inflammatory 
diseases,  asthma,  and  multiple  sclerosis  (Fig.  5).34 
Finally,  the  remarkable  discovery  of  relatively  small 
erythropoietin  (EPO)  agonists  that  act  by  promoting 
homodimerization  of  the  cell  surface  receptor  (Fig.  6) 


has  been  made  in  an  approach  that  could  provide  con¬ 
ventional  therapeutic  replacements  for  the  recombinant 
human  protein  (7  billion/yr  market)  used  for  the  treat¬ 
ment  of  anemias  resulting  from  cancer,  AIDS  and  other 
clinical  disease  states.32 

The  targets  were  carefully  chosen  not  only  for  their 
therapeutic  importance,  but  such  that  each  represents  a 
different  prototypical  extracellular  or  intracellular  sig¬ 
naling  event  involving  protein-protein  or  protein-DNA 
interactions: 
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Library  of  77  analogues  provided  an  improved  lead  structure 
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Figure  3.  Disruption  of  angiogenesis  by  blocking  MMP2  binding  to  integrin  avp3. 
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•  Promote  a  cell  surface  receptor  homodimeriza¬ 
tion  required  for  activation  (EPO  agonist) 

•  Inhibit  an  extracellular  cell  surface  integrin- 
protein  interaction  (MMP2/ocvp3). 

•  Inhibit  an  intracellular  cell  surface  integrin-pro- 
tein  interaction  (Paxillin/a4Pi). 

•  Inhibit  the  intracellular  heterodimerization  of  a 
transcription  factor  (Myc-Max,  LEF-1  /P-catenin). 

•  Inhibit  a  protein-DNA  interaction  of  a  tran¬ 
scription  factor  (LEF-1 /p-catenin). 

Protein-DNA  Interactions:  High-Throughput  Synthesis 
and  Screening  Applied  to  the  Discovery  of  Biologically 
Active  DNA  Binding  Agents 

We  have  also  described  the  high-throughput  synthesis 
and  screening  of  DNA  binding  compounds  l9~28  that  are 
related  to  our  interests  in  understanding  and  exploiting 
their  properties.  The  approach  integrates  the  solution- 


phase  techniques  for  the  synthesis  of  libraries  with  a 
technique  we  recently  introduced  for  rapid  high- 
throughput  screening  for  DNA  binding  affinity  or 
sequence  selectivity  (Fig.  7).20,23  These  techniques  can 
be  combined  to  rapidly  explore  and  define  the  structural 
features  responsible  for  the  sequence  selective  DNA 
binding  properties  of  known  agents,  to  discover  new 
paradigms  for  small  molecule  recognition  of  DNA  (new 
bp  codes),  and  to  screen  for  compounds  that  selectively 
target  consensus  sequences  of  transcription  factors  for 
controlling  aberrant  gene  transcription  (i.e,  LEF-l/P- 
catenin). 


Highlights  of  these  efforts  to  date  include: 

•  Development  of  a  high  resolution,  high 
throughput  FID  assay  for  assessing  DNA  bind¬ 
ing  affinity  or  selectivity.20'23’25 
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’  Screen  conducted  with  novel  FRET  assay 

>  >40,000  compounds  screened  in  FRET  assay 

•  Inhibition  of  Myc-Max  dimerization:  FRET  assay 

>  Inhibition  confirmed  in  ELISA  binding  assay 

>  Inhibition  of  Myc-Max  DNA  binding:  gel  shift  assay 

•  Inhibition  of  c-Myc  induced  transformation  in 

cellular  functional  assay  validating  the  target  for 
small  molecule  antineoplastic  intervention _ 


0 


•  Gain  of  Myc  function  is  seen  in  many  human  tumors: 
constituitive  and  over  expression 

•  All  known  functions  of  Myc  require  dimerization  with  Max 

•  DNA  Binding  of  Myc-Max  dimers  activates  transcription 
of  Myc  target  genes 


Boc 


C02ffiu 


CCfiQC 

CONHMe  S02 
CI'-^y-Br 

First  small  molecule  's_^ 
inhibitor  of  Myc-Max  Cl 
dimerization 


Figure  4.  Inhibition  of  Myc-Max  (transcription  factor)  heterodimerization  and  abberant  gene  transcription:  antitumor  target  validation. 


a4Pi  =  VLA-4  (Very  Late  Antigen  4) 


Chem.  Biol.  2002,  9,1219. 

•  >40,000  compounds  screened  for  inhibition  of  Paxillin/rx4  binding 

•  ELISA  assay  for  Paxillin  binding  to  immobilized  a4  cytoplasmic  tail 

•  Lead  emerged  from  conventional  and  positional  scanning  libraries 

•  20  derivatives  defined  key  structural  features 
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validating  target  for  treatment  of  chronic  inflammatory  diseases 


Figure  5.  Inhibition  of  cell  migration  by  blocking  intracellular  Paxillin/a4  binding. 
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>  >100,000  molecules  screened  for  EPOR  binding 
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•  Library  of  1 15  A  analogues  provided  an  improved  lead  structure 

■  B  Subunit  features  contributing  to  activity  identified:  77  analogues 

■  Library  of  70  C  analogues  provided  an  improved  lead  structure 

•  Three  1000  member  libraries  with  alternative  templates  examined 

•  Higher  order  IDAs  examined:  binders  examined  in  functional  assay 


DNA 


Induction  of 

h  n 

|  T«r,scnptton 

Cytokine-Responsive 
Promoter  Clement 


•  Agonist  from  antagonists  strategy  delineated 

•  Convergent  versus  divergent  synthetic  strategy 

•  rhEPO  ($7  billion/yr  sales)  is  used  for  the  treatment  of 
anemias  including  those  resulting  from  chemotherapy 

Figure  6.  EPO  agonists  (mimics)  that  function  by  promoting  EPOR  dimerization. 


•  EPO  agonist  active  in 
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Figure  7.  Solution-phase  synthesis  of  DNA  minor  groove  binding  ligands  and  FID  assay  for  DNA  binding  affinity  and  selectivity. 


•  Disclosure  of  some  of  the  first  libraries  of  DNA 
binding  compounds  (distamycin,  CC- 1065).20- 22  24 

•  Discovery  of  a  general  hairpin  versus  extended 
DNA  binding  of  a  substituted  P-alanine  linked 
polyamide.26 

•  The  first  characterization  of  cooperative  exten¬ 
ded  2:1  side-by-side  parallel  (vs  antiparallel) 
DNA  binding  with  a  novel  class  of  iminodiacetic 
acid  (IDA)  linked  polyamides.27 
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I  he  high-throughput  synthesis  and  screening  of  compound  libraries 
hold  tremendous  promise  for  drug  discovery  and  powerful  methods 
for  both  solid-phase  and  solution-phase  library  preparation  have 
been  introduced.  The  question  of  which  approach  (solution-phase 
versus  solid-phase)  is  best  for  the  preparation  of  chemical  libraries 
has  been  replaced  by  which  approach  is  most  appropriate  for  a 
particular  target  or  screen.  Herein  we  highlight  distinctions  in  the 
two  approaches  that  might  serve  as  useful  considerations  at  the  onset 
of  new  programs.  This  is  followed  by  a  more  personal  account  of  our 
own  focus  on  solution-phase  techniques  for  the  preparation  of 
libraries  designed  to  modulate  cellular  signaling  by  targeting 
protein-protein  or  protein-DNA  interactions.  The  screening  of  our 
libraries  against  a  prototypical  set  of  extracellular  and  intracellular 
targets,  using  a  wide  range  of  assay  formats,  provided  the  first  small- 
molecule  modulators  of  the  protein-protein  interactions  studied, 
and  a  generalized  approach  for  conducting  such  studies. 
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1.  Introduction 

Combinatorial  chemistry  has  undergone  rapid  development 
and  has  provided  an  important  approach  for  target  validation 
and  drug  discovery.  As  a  consequence  of  its  extension  from 
peptide  and  oligonucleotide  synthesis,  the  majority  of 
approaches  have  relied  on  solid-phase  synthesis  techniques 
and  many  reviews  have  treated  this  emerging  science.11"161  A 
complement  to  adapting  solution-phase  chemistry  to  polymer- 
supported  combinatorial  synthesis  is  the  development  of 
protocols  for  solution-phase  combinatorial  synthesis.117'311  A 
number  of  approaches  have  been  introduced  for  this  purpose 
including  dynamic  combinatorial  chemistry,132,331  soluble  poly¬ 
mer-supported  combinatorial  synthesis  (pioneered  by  Janda 
et  al.), 134-3111  fluorous-phase  synthesis  (FPS)139,401  and  other  tag¬ 
ging  strategies,1411  precipitation  techniques,142-151  and  resin- 
bound,1*"521  soluble  polymer-bound,1531  and  fluorous-phase 
bound130,401  catalysts,  reagents,  or  scavenging  reagents.146-521 
Preceding  this  work,  we  introduced  a  simple  approach  that 
permits  the  multistep  solution-phase  synthesis  of  chemical 
libraries  employing  liquid-liquid  and  liquid-solid  (ion- 
exchange  resin)  extractions  to  remove  residual  starting  materi¬ 
als,  reagents,  and  reagent  byproducts  providing  the  purified 
product  (>95%  pure)  irrespective  of  the  reaction  effi¬ 
ciency.157-71’1  It  has  been  implemented  in  formats  for  the  parallel 
synthesis  of  individual  compounds162  61,1  (1000-member  libra¬ 
ries),  modest-sized  libraries  composed  of  small  mixtures 
(1000-10000-member  libraries,  10-50  compounds  per  mixture) 
including  positional-scanning  and  deletion-synthesis  libra¬ 
ries,16,3-651  or  combinatorially  assembled  to  provide  large  libraries 
(25000-1000000-member  libraries,  10000-28000  compounds 
per  mixture).1651  This  flexibility  allows  the  method  to  be  adopted 
in  a  format  compatible  with  any  screening  objective.  Thus,  its 
implementation  is  convenient  for  either  lead  discovery  or  lead 
optimization  and  produces  the  library  members  on  a  scale  that 
allows  their  repeated  use  in  screening  without  resynthesis.  Such 
libraries  have  been  prepared  in  which  members  have  been 
found  to  interfere  with  (antagonists)  or  mimic  (agonists) 


extracellular  or  intracellular  protein-protein  interactions, 
inhibit  intracellular  enzymes,  or  modulate  intranuclear  pro¬ 
tein-DNA  interactions.  The  chemistry  has  proven  applicable  to 
natural  product  scaffolds,  and  cyclic  (depsi)peptides  and 
possesses  a  scope  that  exceeds  what  one  might  initially  imagine 
based  on  its  simplicity.157-991 

This  technology  is  being  developed  as  an  integral  compo¬ 
nent  of  efforts  to  modulate  cellular  signaling  by  inhibiting, 
promoting,  or  mimicking  protein-protein  or  protein-DNA 
interactions,  especially  those  that  lack  existing  small-molecule 
leads  and  target  structural  information.  Despite  the  prevailing 
bias  that  such  targets  might  not  prove  viable  for  small- 
molecule  intervention,1100,1011  the  screening  of  our  libraries 
provided  the  first  small-molecule  modulators  of  the  protein- 
protein  interactions  being  studied,  validated  the  targets  for 
therapeutic  intervention,  and  provided  a  generalized 
approach  to  conducting  such  investigations.11021  Herein,  we 
review  these  efforts  by  first  detailing  methodology  introduced 
for  the  solution-phase  synthesis  of  chemical  libraries  followed 
by  a  more  personal  summary  of  our  applications  in  the 
validation  of  a  series  of  new  therapeutic  targets. 

2.  Solution-Phase  versus  Solid-Phase  Combinatorial 
Chemistry 

2.1.  Key  Distinctions  of  Solution-Phase  Combinatorial 

Approaches 

Solid-phase  combinatorial  chemistry  has  been  widely 
implemented  for  lead  discovery  and  lead  optimization 
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because  of  the  simplicity  by  which  the  compounds  can  be 
isolated  once  they  are  attached  to  solid  supports.  Several 
advances  in  the  implementation  of  solid-phase  library  syn¬ 
thesis  paved  the  way  for  its  widespread  acceptance  including 
the  spatially  arrayed  multipin  peptide  synthesis  of  Geysen 
et  al., 1103,1041  the  tea-bag  method  of  peptide  synthesis  intro¬ 
duced  by  Houghten  et  al., 11051  phage  display  from  Smith 
et  al. , 1106 1071  the  spot  or  disc  synthesis  developed  by  Frank 
et  al., 1108  1091  and  especially  the  (portion-mixing)  split-and-mix 
solid-phase  synthesis  on  beads  introduced  by  Furka 
et  al. 1110-1121  and  also  disclosed  by  Houghten  et  al. 11131  (divide, 
couple,  and  recombine),  and  Lam  et  al.1"4-1161  (split  synthesis), 
and  Affymax’s  light-directed,  spatially  addressable,  immobi¬ 
lized  parallel  synthesis.11171  Further  promoting  the  widespread 
acceptance  of  split-and-mix  solid-phase  synthesis,  several 
identification  techniques  were  introduced  including  itera¬ 
tive1"0118,1191  and  recursive11201  deconvolution,  or  nucleotide- 
'[121.222]  peptide-,1123-1251  chemical-  ,112'’-1281  radiofrequency- 
[129.130]  coior.(l|31l  anc)  shape-encoded11321  solid-supported 
libraries.  Similarly,  advances  in  microarray  synthesis  continue 
to  improve  large-scale  spatially  arrayed  parallel  synthesis  on  a 
variety  of  solid  supports.1133-1361  In  addition  to  the  many  merits 
of  solid-phase  synthesis,  the  isolation  of  the  immobilized 
product  by  simple  filtration  permits  the  use  of  large  reagent 
excesses  to  effect  high-yield  conversions  for  each  of  the  steps 
(Table  1).  However,  its  scale  can  be  restricted  by  the  amount 
of  required  solid  support  and  its  loading  capacity  and  the 
production  of  significant  quantities  of  each  library  member 
can  be  cumbersome  and  expensive  for  large  libraries.  Its  use 
requires  functionalized  substrates  and  solid  supports  for 
attachment,  compatible  spacer  linkers,  orthogonal  attach¬ 
ment  and  detachment  techniques  often  with  the  release  of 
spectator  functional  groups,  specialized  methods  for  monitor¬ 
ing  the  individual  steps  of  a  multistep  synthesis  often 
including  orthogonal  capping  strategies  for  blocking  residual 
substrate,  and  does  not  permit  the  purification  of  resin-bound 
intermediates.  This  last  feature  necessarily  produces  the 
released  product  of  a  multistep  sequence  in  an  impure  state 
and  requires  that  each  step  of  a  multistep  synthesis  on  each 
substrate  proceed  with  high  efficiency.  Advances  addressing 
each  of  these  limitations  have  been  made  which  continue  to 
improve  the  implementation  of  solid-phase  synthesis.  Some  of 
the  most  notable  introductions  include  traceless  linkers,1137-1391 
the  resin  release  of  product  only  upon  successful  synthe- 


Tabler  Advantages  (+)  and  disadvantages  (-)  of  solution-phase  and 
solid-phase  combinatorial  synthesis. 

Solid-phase  Solution-phase 


+  Simple  removal  of  excess  reagents 
and  reactants 

+  Automation  straightforward 

+  Split  and  mix  synthesis  simplifies 
preparation  of  large  libraries 

+  Pseudo-dilution  effects 

-  Adapt  chemistry  to  solid  phase 
and  develop  linking/cleavage 
strategies 

-  Reaction  monitoring  difficult 

-  No  purification  possible 

—  Linear,  but  not  convergent  syn¬ 
thesis 

—  Limited  scale 

—  Cannot  conduct  mixture  synthesis 


-  Removal  of  excess  reagents 
and  reactants  limits  scope 

+  Automation  by  liquid- 
liquid  techniques 

+  Mixture  synthesis  simpli¬ 
fies  preparation  of  large 
libraries 

-(-  Chemistry  not  limited  by 
support 

+  Avoids  extra  steps  for  link¬ 
ing/cleavage 

4-  Monitor  by  traditional 
techniques 

+  Purification  possible  after 
each  step 

+  Convergent  or  linear  syn¬ 
thesis 

+  Unlimited  amounts 
(scales)  available 

-I-  Mixture  or  parallel  synthe¬ 
sis 


sis,11401411  resin-capture  of  only  the  desired  reaction  prod¬ 
ucts,11421  the  introduction  of  safety-catch  linker1143-1451  and 
backbone  amide  linker1146  1471  strategies,  multidirectional 
resin-cleavage  methods,1148  1491  and  improved  resin  properties 
and  loading  capacities.134-381  Limitations  that  cannot  be  altered 
are  that  solid-phase  synthesis  is  incompatible  with  the  use  of 
heterogeneous  catalysts  or  reagents,  is  necessarily  restricted 
to  a  linear  (as  opposed  to  convergent)  synthetic  strategy,  is 
not  easily  amenable  to  the  synthesis  of  mixture  libraries,  and 
is  not  amenable  to  selection  or  dynamic  library  screening141 
involving  target-assisted  synthesis.1150-1541  These  latter  limita¬ 
tions  are  discussed  in  more  detail  in  the  following  sections. 

At  the  time  we  initiated  our  efforts,  no  reports  of  the 
extension  of  the  solid-supported  synthesis  of  chemical 
libraries  to  solution-phase  libraries  had  been  described  and 
a  series  of  related  studies  including  those  of  Smith  et  al., 11551 
Rebek,  et  al., 1156-1591  Pirrung  et  al.,11601  and  Nielsen  et  al.,11611 
and  the  popularization  of  multicomponent  solution-phase 
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reactions  by  Ugi  et  al.1'621  and  Armstrong  et  al.11631  were 
disclosed  around  the  time  of  our  initial  contributions.  The 
distinction  of  our  work  being  that  it  constituted  a  multistep, 
not  a  single-step,  strategy  for  the  solution-phase  synthesis  of 
chemical  libraries. 

Given  that  both  solution-phase  and  solid-phase  sample 
manipulation  are  easily  automated,  the  major  limitation  to 
the  solution-phase  synthesis  of  chemical  libraries  is  the 
isolation  or  purification11641  of  the  library  members.  If  the 
advantages  of  sample  isolation  characteristic  of  solid-phase 
synthesis  can  be  incorporated  into  a  solution-phase  synthesis, 
its  less  limiting  scale,  expanded  repertoire  of  chemical 
reactions,  direct  production  of  soluble  intermediates  and 
final  products  for  purification  or  assay,  and  the  lack  of 
required  linking,  attachment  and  detachment,  or  capping 
steps  make  solution-phase  library  synthesis  especially  attrac¬ 
tive  (Table  1).  It  is  the  only  approach  readily  amenable  to 
convergent  synthetic  strategies,  the  synthesis  of  mixture 
libraries,  or  use  of  dynamic  libraries.  Consequently,  we 
considered  a  number  of  techniques  that  are  available  for 
simplifying  the  isolation  of  solution-phase  samples  and 
focused  on  one  of  the  most  attractive  involving  isolation 
and  purification  by  liquid-liquid  or  solid-liquid  (ion- 
exchange  resins)  acid-base  extractions.  Descriptions  of  the 
initial  libraries  generated  and  their  subsequent  extensions, 
which  included  the  introduction  of  solid-supported 
reagents,157"631  solid-supported  workup  reagents  (scaveng¬ 
ers,11651  ion-exchange  resins157'591)  including  the  introduction 
of  immobilized  scavengers  and  reagents  conducted  within 
CombiChem  in  collaboration  with  L.  Caporale, 11651  purifica¬ 
tion  by  size-exclusion  chromatography  filtration,162,631  and  the 
generation  of  non-amide  libraries  have  been  detailed.1711 


2.2.  Synthetic  Strategies:  Linear,  Convergent,  and  Divergent 

Synthesis 

Traditional  organic  synthesis  has  provided  two  widely 
recognized  strategies  for  synthetic  design:  linear  versus 
convergent  synthesis  (Scheme  1).  The  merits  of  a  convergent 
synthesis  are  widely  recognized;  it  typically  provides  a  shorter 
(number  of  steps),  more  efficient  (higher  overall  yield),  and 
more  effective  (larger  sample  size)  approach  that  is  techni¬ 
cally  simpler  (material  balance)  to  execute  than  a  linear 
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Scheme  i.  Linear  versus  convergent  and  divergent  versus  parallel  syn¬ 
thesis. 


synthesis.  Since  solid-phase  synthesis  is  limited  to  a  linear 
synthetic  strategy,  it  is  not  surprising  that  it  did  not  find 
widespread  use  in  traditional  organic  synthesis.  In  fact,  the 
applications  where  solid-phase  synthesis  did  find  widespread 
acceptance  prior  to  the  advent  of  combinatorial  chemistry 
(solid-phase  peptide  and  oligonucleotide  synthesis)  are  those 
ideally  suited  for  linear  synthesis  involving  the  repetitive  use 
of  a  single  highly  optimized  reaction.  A  third,  less  widely 
recognized,  synthetic  strategy  is  the  divergent  synthesis  where 
a  class  of  compounds  is  prepared  from  a  single  common 
intermediate.  Although  widely  used,  the  formalization  (def¬ 
inition)  of  this  strategy  was  first  detailed  along  with  a  strategy 
for  divergent  aromatic  annotations.1166"1681  Today,  the  diver¬ 
gent  introduction  of  diversity  is  the  basis  of  most  combina¬ 
torial  chemistry  strategies.11691  The  advantages  of  such  a  late- 
stage  divergent  approach  relative  to  the  parallel  synthesis  of  a 
series  of  related  compounds  are  analogous  to  those  of  a 
convergent  versus  linear  synthesis  (more  efficient,  larger 
sample  size,  material  balance). 

The  strategies  are  not  mutually  exclusive  for  diversity- 
oriented  synthesis.  Either  a  linear  or  convergent  synthetic 
strategy  can  be  employed  (Figure  1).  Solid-phase  synthesis  is 
necessarily  limited  to  a  linear  synthetic  strategy  whereas 
solution-phase  synthesis  may  be  adapted  for  either  a  linear  or 
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Figure  i.  Convergent  versus  linear  synthesis.  A)  Linear,  divergent  syn¬ 
thesis  with  multiplication  of  diversity  (solid  or  solution  phase),  B)  con¬ 
vergent  synthesis  with  multiplication  of  diversity  (solution  phase  only). 
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convergent  synthetic  strategy.  Moreover,  the  convergent 
synthesis  is  especially  well  suited  for  solution-phase  synthesis 
and  would  be  precluded  by  solid-phase  techniques  where  the 
combining  components  are  on  mutually  exclusive  solid 
phases. 

From  the  start,  an  important  element  of  our  efforts  to 
probe  protein-protein  or  protein-DNA  interactions  was  the 
ability  to  screen  for  binding  agents  for  each  macromolecular 
partner  (screen  for  antagonists)  which  could  then  be  linked  to 
promote  the  protein-protein  interaction  in  the  absence  of  an 
endogenous  ligand  (e.g.,  agonists  for  receptor  homo-  and 
heterodimerization;  Figure  1).  This  was  the  genesis  of  the  use 
of  the  iminodiacetic  acid  template  for  the  synthesis  of 
libraries  and  it  has  provided  the  first  generalizable  approach 
for  the  discovery  or  development  of  agonists  from  antago¬ 
nists.161  f’3,67,7'1,17n'171'  This  convergent  mix-and-match  dimeriza¬ 
tion  linkage  of  macromolecular  binders  was  effected  by  a 
variety  of  techniques  including  diamide  formation,  olefin 
metathesis)60,61,641  palladium-catalyzed  biaryl  coupling)621  and 
Stille  acetylenic-aryl  couplings16’1  and  can  only  be  accommo¬ 
dated  by  solution-phase,  not  solid-phase  techniques. 

An  additional  example  of  the  superimposition  of  a 
divergent,  diversity-oriented  synthesis  onto  a  convergent 
synthetic  strategy  may  be  found  in  work  that  provided  a 
library  of  HUN-7293  analogues  (1  and  2,  Scheme  2)  which 
was  used  to  explore  each  structural  detail  of  the  natural 
product)761  As  discussed  in  Section  6.1,  a  solid-phase  synthesis 
of  the  library  would  require  a  parallel  linear  synthesis  of  each 


4  Tetrapeptide  Modified  3  Tripeptide 

tetrapeptides 


Scheme  2.  Convergent  synthetic  strategy  for  the  generation  of  a  library 
of  HUN-7293  analogues.  Boc  =  tert-butoxycarbonyl  X  =  0,  NH. 


analogue,  whereas  the  implementation  of  a  solution-phase 
convergent  synthesis  allowed  the  single  synthesis  of  the  key 
tri-  and  tetrapeptide  subunits  of  the  natural  product  (3  and  4) 
and  their  respective  combination  with  diversified  tetra-  and 
tripeptides  containing  single-point  changes  in  the  HUN-7293 
structure  (Scheme  2).  Thus,  the  advantages  commonly  asso¬ 
ciated  with  a  convergent  synthesis  were  combined  with  those 
of  a  divergent  synthesis  to  provide  a  family  of  closely  related 
structures. 


2.3.  Solid-Phase  Split-and-Mix  versus  Solution-Phase  Mixture 

Synthesis  and  Deconvolution  ( Identification )  Methods 

Libraries  of  modest  size  can  be  prepared  easily  by  either 
parallel  solid-phase  or  solution-phase  synthesis,  both  of  which 
can  provide  individual  compounds.  The  advent  of  split-and- 
mix  solid-phase  synthesis  and  the  related  tea-bag  methods 
which  provide  one-compound-per-bead  libraries  made  large 
libraries  accessible  by  solid-phase  synthesis.  The  concurrent 
introduction  of  the  identification  (tagging)  methods  detailed 
in  Section  2.11121-1321  greatly  enhanced  the  value  of  such 
strategies  by  providing  an  immediate  identification  of  screen¬ 
ing  hits.  The  most  valuable  of  these  encoding  technologies 
being  those  that  do  not  add  complexity  to  the  library  synthesis 
(Table  2). 

Mixture  synthesis  provides  for  solution-phase  synthesis 
what  split-and-mix  synthesis  provided  for  solid-phase  techni¬ 
ques  (Table  2))1721  Unlike  solid-phase  synthesis  where  the 
polymer-bound  substrate  must  be  the  stoichiometrically 
limiting  reaction  partner,  either  the  substrate  or  the  reacting 
attachment  groups  may  be  limiting  in  solution-phase  chemis¬ 
try.  This  situation  dictates  the  use  of  split-and-mix  synthesis  in 


Table  1:  Comparison  of  library  techniques. 


Technique 

Single  com- 

Speed  of 

SAR1’' 

Utility 

pound 
or  mixture 

synthesis 

retrieval 

parallel  synthesis 

single 

slow 

fast 

lead  optimi¬ 
zation 

mixture  synthesis 

mixture 

fast 

slow 

lead  identifi- 

(scanning/dele¬ 
tion  deconvolu- 

(fast) 

(fast) 

cation 

tion) 

parallel,  spacial 

single 

moderate 

fast 

lead  identifi- 

array 

cation 

or  optimiza¬ 
tion 

split  and  mix 

mixture 

moderate 

slow 

lead  identifi- 

(one  com- 

cation 

pound  per 

or  optimiza- 

bead) 

tion 

encoded  split  and 

mixture 

moderate 

moderate 

lead  identifi- 

mix 

(one  com- 

cation 

pound  per 

or  optimiza- 

bead) 

tion 

mix  and  sort 

single 

moderate 

fast 

lead  identifi- 

(microreactors) 

cation 

or  optimiza¬ 
tion 

[a]  SAR  =  structure  activity  relationship. 
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the  solid-phase  synthesis  to  accommodate  different  reaction 
rates,117’1  whereas  the  simpler  method  of  mixture  synthesis 
with  limiting  attachment-group  stoichiometry  may  be  used  in 
solution  to  ensure  all  library  members  are  generated  in  equal 
amounts.  The  cases  where  mixture  synthesis  has  been 
conducted  on  the  solid-phase  enlists  an  excess  of  the  reacting 
attachment  groups  in  adjusted  concentrations  to  accommo¬ 
date  the  different  reaction  rates  and  requires  that  this  rate 
information  be  available  at  the  onset  of  the  mixture  synthesis. 
In  contrast,  the  implementation  of  solution-phase  mixture 
synthesis  only  requires  the  ability  to  remove  residual  sub¬ 
strate.  Although  this  is  not  possible  with  solid-phase  synthesis, 
it  is  readily  accomplished  in  our  efforts  by  simple  acid-base 
liquid-liquid  or  liquid-solid  extraction. 

Analogous  to  encoding  strategies  for  solid-phase  split- 
and-mix  synthesis  of  large  libraries,  the  two  important  and 
complementary  techniques  of  positional  scanning1'7’1  and 
deletion  synthesis164,651  permit  the  immediate  identification 
of  active  leads  from  large  mixture  libraries  (Figure  2). 
Because  the  preparation  of  positional-scanning  and  dele¬ 
tion-synthesis  libraries  entails  implementing  mixture  synthe¬ 
sis,  they  are  not  easily  adapted  to  solid-phase  techniques. 
Typically,  global  observations  are  detected  with  a  positional- 
scanning  library  and  useful  lead  structures  (although  not 
necessarily  the  most  potent  member)179,86,1751  with  defined 
properties  are  identified.  Deconvolution  with  positional- 
scanning  libraries  is  more  sensitive  because  an  increase  in 
activity  is  measured,  and  this  allows  the  identification  of  less- 
active  lead  structures.  Deletion  synthesis,  on  the  other  hand, 
screens  for  a  loss  of  activity,  provides  less  global  information, 
but  is  better  at  identifying  a  uniquely  potent  library 
member.164,65,1741  The  combination  is  more  powerful  than 
either  technique  alone.1641  Importantly,  the  lead  identification 
is  deduced  from  a  single  round  of  screening.  Thus,  not  only 
are  positional-scanning  or  deletion-synthesis  libraries  much 
less  time  intensive  to  prepare  than  the  parallel  synthesis  of 
individual  compounds  or  small  mixtures  and  technically  less 
demanding  than  spacially  arrayed  or  encoded  split-and-mix 
library  synthesis,  but  they  produce  depository  libraries  for  use 
in  multiple  screens  capable  of  immediate  deconvolution  in  a 
single  round  of  testing.  Although  a  subtle  distinction  and  not 
typically  a  consideration,  the  convergent  synthesis  of  dimeric 
libraries  providing  symmetrical  and  unsymmetrical  combina¬ 
tions  cannot  be  carried  out  with  positional  scanning  libraries, 
but  must  use  deletion  synthesis  to  identify  active  unsym¬ 
metrical  dimers.1641  Relative  to  the  screening  of  individual 
compounds  or  small  mixtures,  the  more  subtle  structure-ac¬ 
tivity  relationship  information  contained  in  a  library  is  often 
missed.  However,  the  disadvantages  associated  with  the  loss 
of  this  information  is  balanced  against  the  ease  of  synthesis  of 
the  parent  libraries  and  must  be  judged  in  light  of  the 
screening  objectives  (lead  identification  or  lead  optimiza¬ 
tion),  and  the  nature  of  the  target. 

A  number  of  analytical  techniques  have  also  emerged  as 
useful  approaches  to  lead  identification  from  complex 
mixtures,  and  these  include  assay  (detection)  and  identifica¬ 
tion  by  mass  spectrometry,1'761  NMR  spectroscopy,1177,1781  and 
capillary  electrophoresis,1'791  and  have  expanded  the  utility 
and  dynamic  range  for  screening  such  mixture  libraries. 
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Figure  2.  Examples  of  positional-scanning  and  deletion-synthesis 
deconvolution  techniques  for  a  library  that  contains  four  variable  posi¬ 
tions  with  four  substitutions  at  each  position.  Positional  scanning:  In 
each  sublibrary,  an  individual  component  is  placed  in  one  position  but 
mixtures  are  used  in  all  other  positions.  The  most  active  sublibrary  in 
each  set  (*)  indicates  the  optimal  substituent  for  that  position.  Dele¬ 
tion  synthesis:  In  each  sublibrary,  an  individual  component  is  missing 
in  one  position  (e.g.  dA2  signifies  all  A  substituents  except  A2),  but  full 
mixtures  are  used  in  all  other  positions.  The  least  active  sublibrary  in 
each  set  (*)  indicates  the  optimal  substituent  for  that  position,  that  is, 
the  missing  component  is  the  most  active. 


3.  Solution-Phase  Library  Preparation:  Cyclic 
Anhydride  Templates 

A  summary  of  all  the  work  on  solution-phase  libraries  is 
beyond  the  scope  of  this  article  and  excellent  coverage  is 
provided  in  a  number  of  recent  reviews.117"3'1  However,  we 
would  like  to  briefly  describe  the  basis  of  our  own  library 
preparations  highlighting  the  design  elements  considered  in 
the  context  of  the  biological  targets  we  addressed.  Template  5 
(Scheme 3),  the  first  of  a  representative  set  of  five-  and  six- 
membered  cyclic  anhydride-based  templates  that  have  been 
examined,  consists  of  a  densely  functionalized  core  which 
imposes  little  structural  or  conformational  bias  that  might 


Ai 

Bh 

Cm 

Dm 

*  A2 

Bm 

Cl-4 

Dm 

A3 

Bm 

Cm 

Dm 

A4 

C1-4 

Dm 

*Am 

Bm 

Ci 

Dm 

Am 

Bm 

c2 

Dm 

Am 

Bm 

cs 

0,-4 

Am 

Bm 

C4 

Dm 

Am 

Bi 

Cm 

Dm 

Am 

B? 

C1-4 

Dm 

Am 

Bs 

Cl-4 

Dm 

*Am 

b4 

Cm 

Dm 

*  Am 

bm 

Cm 

D, 

Am 

Bm 

Cm 

d2 

Am 

Bm 

Cm 

d3 

Am 

bm 

cm 

d4 

dA-| 

Bm 

C1-4 

Dm 

*dA2 

Bm 

Cm 

Dm 

dA3 

Bm 

C1-4 

Dm 

dA4 

Bm 

Ci -4 

Dm 

‘Am 

Bm 

dC, 

Dm 

Am 

Bm 

dC2 

Dm 

Am 

Bm 

dC3 

Dm 

Am 

Bm 

dC4 

Dm 

am 

dB1 

Cm 

Dm 

Am 

dB2 

Cm 

Dm 

Am 

dB3 

Cm 

Dm 

*  am 

dB4 

Cm 

Dm 

*  At. 4 

Bm 

Cm 

dD, 

Am 

Bm 

C1-4 

dD2 

Am 

Bm 

Cl-4 

dD3 

Am 

Bm 

cm 

dD4 

Angew.  Chem.  Int.  Ed.  2003,  42,  4138-4176 


www.angewandte.org 


©  2003  Wiley-VCH  Verlag  GmbH  &  Co.  KGaA,  Weinheim  4^43 


Reviews 


D.  L.  Boger  et  al. 


h:0 

Boc-N  O 


R’Nu 


XR1 

r~i  r!nu 

Boc-N  O  - 

'' — COzH 

15 


/ — COXR1 
Boc-N  ,0 

M! 

16  XR2 


RLi,  RMgBr,  RZn 
RNH2,  R2NH,  ROH 


RNH2i  R2NH,  ROH 


B0C"N  ]|  Q 

8  0 


Gly-X  mimetic 


Gly-X  mimetic 


12 


Giy-Asp  mimetic 


Giy-Asp  (GD) 


HCI/dioxane 


R2  / — COXR1 

"  X-N  \ 

-COXR2*  j 

'  q  )  ' 

any  acylating  *  any 
agent  nucleophile 


6' 


Scheme  4.  Three-step  solution-phase  synthesis  of  iminodiacetic  acid 
chemical  libraries.  Compound  15  is  purified  by  washing  with  acid,  16 
and  9  by  washing  with  acid  and  base.  (The  R  groups  of  the  reagents  in 
boxes  become  R',  R2,  R2.) 
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Scheme  j.  Representative  templates. 


limit  its  use.157,581  Interesting  examples  of  alternative  templates 
include  the  dipeptidomimetic  template  8,  which  contains  a 
rigid  bicyclic  core  with  a  plane  of  symmetry  that  allows  it  to 
function  as  a  Giy-Asp  or  more  general  Gly-X  mimetic.157-5'31 
Its  symmetrical  structure,  like  that  of  5,  contains  three 
positions  that  can  be  sequentially  functionalized  enabling 
the  synthesis  of  libraries  with  up  to  three  variable  groups,  A 
more  rigid  variant  of  a  Gly-X  mimetic  14,  isoindoline-5,6- 
dicarboxylic  acid  (11),  has  also  been  examined.1'191  Impor¬ 
tantly,  an  array  of  such  templates,  such  as  5-7  (Scheme  3), 
displays  the  introduced  diversity  in  conformationally  con¬ 
strained  or  different  spacially  defined  locations  providing  rich 
structure-activity  relationship  information  on  the  target  of 
interest  in  a  single  round  of  testing. 

The  anhydride  template  is  activated  for  the  first  function¬ 
alization,  which  is  conducted  by  nucleophilic  addition,  and 
generates  a  free  carboxylic  acid  group  as  its  second  function¬ 
alization  site  (Scheme  4).  Although  a  subtle  design  element, 
this  release  of  the  second  functionalization  site  concurrent 
with  the  first  diversification  not  only  provides  an  isolation  and 
purification  handle,  but  it  avoids  a  separate  deprotection  step 
for  the  subsequent  diversification.  In  addition,  the  use  of 
symmetrical  anhydrides  insures  only  a  single  regioisomer  of 
the  product  is  generated  simplifying  the  potential  complexity 
of  the  library.  The  final  diversification  entails  deprotection  of 
the  secondary  amine  which  unmasks  a  third  coupling  position. 
At  each  step,  the  released  functionality  aids  not  only  the 
isolation,  but  provides  a  method  for  separation  of  each  of  the 
intermediates  and  final  products  from  starting  materials, 
reactants,  reagents,  and  their  byproducts  by  simple  liquid- 


liquid  or  liquid-solid  acid-base  extraction  providing  pure 
materials  (typically  >95%)  independent  of  reaction  effi¬ 
ciencies,  and  a  number  of  classes  of  nucleophiles  and  electro¬ 
philes  can  be  used  to  functionalize  the  template 
(Scheme  4).17'1 

This  approach  has  been  used  to  provide  libraries  in  a 
variety  of  formats  (individual  compounds,  small  mixtures, 
large  mixtures,  positional-scanning,  deletion-synthesis  libra¬ 
ries)  applicable  to  both  lead  generation  or  lead  optimization. 
In  fact,  it  was  in  this  context  that  deletion-synthesis  libra¬ 
ries1^  were  introduced  and  compared  with  other  library 
formats  including  positional  scanning.1'’5’79’861  Although  this 
work  is  most  often  cited  for  its  isolation  and  purification  by 
liquid-liquid  acid-base  extraction,  the  initial  disclo¬ 
sures157-59,681  detailed  one  of  the  first  applications  of  poly¬ 
mer-bound  immobilized  reagents  in  solution-phase  library 
synthesis  (polymer  supported  EDCI  (PS-EDCI),  EDCI  =  1- 
ethyl-3-(3-dimethylaminopropyl)carbodiimide)  and  also 
described  the  first  use  of  ion-exchange  resins  in  liquid-solid 
“extraction”  workups.1581  The  approach  has  been  shown  to 
dependably  deliver  pure  compounds  in  5-150  mg  quantities 
(>95%  pure)  and  multiple  libraries  have  been  assembled 
providing  over  40000  compounds  to  date.  It  avoids  the 
disadvantages  of  solid-supported  synthesis,  including  the 
more  restrictive  scale,  the  required  functionalized  substrates 
and  supports,  compatible  linkers,  and  the  requirements  for 
orthogonal  attachment  and  detachment  chemistry.  It  does  not 
require  specialized  methods  for  monitoring  each  step  of 
multistep  syntheses,  allows  the  purification  of  intermediates, 
and  provides  the  final  pure  products  directly  for  use  in 
binding  or  functional  assays.  Extensions  of  these  studies  for 
generating  symmetrical  or  unsymmetrical  chemical  libraries 
suitable  for  probing  receptor  and  protein  homo-  and  hetero¬ 
dimerization  events  were  described  (Scheme  5).  The  dimeri¬ 
zation  linkage  was  accomplished  not  only  through  amide- 
bond  formation  with  dicarboxylic  acids,  but  also  by  Stille 
coupling,1631  by  the  first  application  of  an  intermolecular 
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Scheme  s-  Homo-  or  heterodimerization  of  libraries  for  protein-protein 
dimerization.  The  deconvolution  of  the  unsymmetrical  dimerization 
products  is  by  scanning  or  deletion  synthesis.  Cy  =  cyclohexyl, 

PyBrOP  =  bromotris(pyrrolidino)phosphonium  hexafluorophosphate. 


olefin-metathesis  reaction  in  the  generation  of  chemical 
librariesj60,6'-641  and  by  one  of  the  first  disclosed  applications 
of  solid-supported  catalysts  (Pd-C)  in  solution-phase  library 
preparation  that  was  used  to  promote  a  biaryl  coupling  (see 
Scheme  5). 1621  In  this  context,  the  use  of  size  exclusion 
chromatography  (filtration)  for  the  rapid  purification  of 
such  dimer,  or  higher  oligomer,  libraries  was  also  demon¬ 
strated.1621 

Thus,  the  preparation  and  dimerization  linkage  of  imino¬ 
diacetic  acid  diamides  can  be  conducted  in  a  reaction 
sequence  that  requires  only  three  steps.  In  addition  to  the 
multiplication  of  the  diversity  that  arises  through  the  combi¬ 
natorial  dimerization  linkage  of  the  iminodiacetic  acid 
diamides,  the  solution-phase  synthesis  of  the  intermediates 
permits  their  direct  linkage  which  would  be  precluded  by 
solid-phase  synthesis  techniques.  As  such,  the  strategy  is 
uniquely  suited  for  taking  advantage  of  such  dimerization 
(convergent)  strategies  utilizing  a  limited  number  of  synthetic 
steps.  This  modular  approach  to  the  generation  of  libraries  is 
especially  well-suited  for  the  discovery  of  antagonists  or 
agonists  of  receptor  and  protein  homo-  and  heterodimeriza¬ 
tion.  Simple  binders  (i.e.,  16-18,  Scheme  5)  can  serve  as 
antagonists  of  ligand-induced  receptor  or  protein  dimeriza¬ 
tion.  Covalently  linked  symmetrical  dimers  (i.e.,  19-21)  can 
be  utilized  to  promote  receptor  or  protein  homodimerization 
whereas  unsymmetrical  dimers  can  be  utilized  to  promote 
receptor  or  protein  heterodimerization.  Thus,  both  antago¬ 
nists  and  agonists  may  be  developed  depending  on  the 
therapeutic  application  and  this  was  a  major  factor  in  our 
adaptation  of  this  approach.  Notably,  the  size  and  character¬ 
istics  (cell  permeability)  of  compounds  targeting  intracellular 


macromolecules  may  place  restrictions  on  the  library  compo¬ 
sition,  whereas  libraries  of  compounds  addressing  extracel¬ 
lular  targets  including  cell  surface  receptors  and  proteins  may 
not  be  influenced  by  such  limitations. 

One  might  arguably  suggest  from  an  examination  of  the 
anhydride  templates  that  the  structural  complexity,  available 
diversity,  and  drug-like  properties  of  the  library  members,  or 
scope  of  the  technology  may  be  restricted  producing  libraries 
of  limited  value.  However,  the  present  40000-member  library 
has  been  remarkably  siccessful  in  providing  leads  for  essen¬ 
tially  any  target  it  has  been  screened  against.  Counter  to  the 
prevailing  trend  of  preparing  libraries  of  increasing  molecular 
rigidity  and  complexity,  it  may  well  be  that  this  success  can  be 
attributed  to  the  library's  appropriate  “intermediate”  com¬ 
plexity  where  each  library  member  embodies  multiple 
pharmacophores  more  suited  for  lead  discovery.11801  It 
should  also  be  emphasized  that  alternatives  to  the  amide 
couplings  even  for  the  initial  library  generations  have  been 
disclosed.17'1  In  addition  and  although  this  is  discussed  in 
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{+J-CC-1065 


Triostin  A 


Distamycin  A 


Scheme  6.  Natural  product  templates  where  analogues  were  synthe¬ 
sized  using  the  solution-phase  combinatorial  approach. 
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greater  detail  in  Sections  5  and  6, 
complex  natural  product  libraries 
(Scheme  6)  have  been  assembled 
using  this  technology  indicating 
that  the  scope  extends  well 
beyond  what  one  might  initially 
suspect. 


4.  Targeting  Protein-Protein 
Interactions 

As  attractive  as  the  potential  is 
for  modulating  aberrant  cellular 
signaling1'81''901  by  inhibiting,  pro¬ 
moting,  or  mimicking  protein-pro- 
tein1'00, 1821  or  protein-DNA 
interactions,  the  general  consensus 
has  been  that  such  targets  repre¬ 
sent  stubborn  problems  for  small- 
molecule  therapeutic  intervention. 
The  challenges  posed  by  such  tar¬ 
gets  arise  from  the  large  surface 
area  generally  covered  by  the  two 
interacting  proteins  (approxi¬ 
mately  800  A2  per  protein  on  aver¬ 
age),  the  relatively  flat  interface 
between  binding  proteins,  and  the 
range  of  affinities  between  proteins 
depending  on  their  physiological 
function.1""1  Although  there  are 
many  examples  of  antibodies,  dom¬ 
inant  negative  proteins,  and 
medium-sized  peptides  that  inhibit 
protein-protein  interactions,  the 
disclosure  of  small  molecules  has 
not  been  as  extensive.  Because  the 
nature  of  protein-protein  interac¬ 
tions  differ  widely  (hydrophobic 
versus  hydrophilic,  the  surfaces 
can  be  relatively  flat  and/or  have 
buried  crevices),  since  the  affinities 
span  a  considerable  range  (KA  = 
10~6-10'12m),  and  the  response  to 
inhibition  may  vary  considerably 
(the  need  to  simply  perturb  the 
equilibrium  versus  need  for  com¬ 
plete  inhibition),  it  is  fair  to  assume 
that  some  protein-protein  interac¬ 
tions  may  prove  to  be  better  targets 
than  others  for  small-molecule 
intervention.  Table  31'91'2251  sum¬ 
marizes  many  of  the  recent  prom¬ 
ising  results  of  investigations  on  a 
range  of  targets  and  Scheme  7 
shows  the  structures  of  some  of 
the  small-molecule  modulators.  It 
is  clear  that  a  reevaluation  of  the 
prevailing  bias  against  such  pro- 


Tab\ey.  Small  molecule  modulators  of  protein-protein  interactions. 


Protein-protein 

Interaction 

Compound  Source 

Screen 

Lead  Structure"1 

Activity 

Ref. 

Ribonucleotide 

Reductase 

(dimerization) 

Peptide  analogue 

Competitive 
binding  assay 

BILD  1263 

Antagonist 

IC50  =  0.3  nM 

[197] 

iNOS  (dimeriza¬ 
tion) 

Encoded  solid- 
phase  library 

Functional 

assay 

iNOS  inhibitor 

IC50  =  0.6  nM 

[198] 

HIV  Protease 
(dimerization) 

Designed  inhibitor 

Enzyme  inhibi¬ 
tion 

HIV-1  protease 
inhibitor 

Ki  =  310nM 

[199a] 

HIV  Protease 
(dimerization) 

Natural  product 

Enzyme  inhibi¬ 
tion 

Didemnaketal  A 

IC50  =  2pM 

[199b] 

HIV-1  Integrase 
(dimerization) 

Interface  peptides 

Enzyme  inhibi¬ 
tion 

*“ 

i  C50  —  2—4  JIM 

[200] 

CaM/smMLCK 

rt-helix  mimetic 

Enzyme  inhibi¬ 
tion 

CaM  antagonist 

ICS0  =  9  nM 

[201] 

MMP-2/avf?3 

Solution  phase 
library 

Radiolabeled 
binding  assay 

27  (Figure  11) 

Angiogenesis 

inhibitor 

[77,78] 

Paxillin/a4 

Positional  scan¬ 
ning  library 

ELISA 

A7B7C7 

(Figure  13) 

Antagonist 

ICSO  =  300  nM 

[79] 

Bax/Bcl-xL 

16320  compound 
library 

Fluorescence 

polarization 

assay 

BH3I-1 

BH3I-1' 

BH3I-2 

Antagonist 

ICS0  =  8- 

16  pM 

[202] 

Bax/Bcl-xL 

Natural  product 

Functional 

assay 

Antimycin 

Antagonist 

[203] 

Bax/Bd-xL 

Virtual  screening 
of  193  383  com¬ 
pounds 

Fluorescence 

polarization 

assay 

HA14-1 

Antagonist 

IC50  =  9  JIM 

[204] 

Bax/Bd-xL 

Designed  (x-helix 
mimetic 

Fluorescence 

polarization 

assay 

Bax  antagonist-1 

kD  — 11 4  nM 

[205  a] 

Bax/Bcl-xL 

Designed  a-helix 
mimetic 

Fluorescence 

polarization 

assay 

Bax  antagonist-2 

Kj  =  1 .6  pM 

[205b] 

MDM2/p53 

Natural  product 

ELISA 

Chlorofusin 

Antagonist 

IC50  =  4.6  pM 

[206a] 

MDM2/p53 

Unknown 

ELISA,  NMR 

Chalcone  B 
Antagonist 

I  l 

•sr  0 

II  ^ 
0  II 

[206b] 

HDM2/p53 

Designed  inhibitor 

ELISA 

HDM2  Antago¬ 
nist 

IC50  =  5  nM 

[207] 

Myc/Max 

Solution  phase 
library 

FRET 

28  (Figure  17) 

Antagonist 

IC50  =  20  pM 

[80] 

LEF-l/P-catenin 

Solution  phase 
library 

Reporter  assay 

A1B2C8 
(Figure  18) 

Antagonist 

[71] 

NGF/p75 

NGF/TrkA 

Commercial  libra¬ 
ries 

Radiolabeled 
binding  assay 

ALE-0540 

IC50  — 6  pM 

ICso  =  4pM 

[208] 

HIV-1 

(RANTES)/CCR5 

Corporate  library 

Radiolabeled 
binding  assay 
(RANTES) 

TAK-779 

Antagonist 

1 C50  =  1.4  nM 

[209] 

IL-2/IL-2R 

Designed  inhibitor 

Radiolabeled 
binding  assay 

Ro26-4550 

Antagonist 

IC50  =  3  JIM 

[210a] 

IL-2/IL-2R 

Structure-based 
design  -f  frag¬ 
ment  assembly 

Competitive 

binding 

IL-2  Antagonist 

IC50  =  60  nM, 

Xd  =  100  nM 

[210b] 

IL-1/IL-1R 

Designed  inhibitor 

Scintillation 
proximity  assay 

IL-1  Antagonist 

IC50<  10  pM 

[211] 

EPO/EPOR 

Phage  display 

Radiolabeled 
binding  assay 

EMP1 
(Figure  20) 

EPO  Agonist 
ICS0  =  0.2  pM 

[319-321] 

EPO/EPOR 

Corporate  library 

Radiolabeled 
binding  assay 

EPO  antagonist 

ICSO  =  60  pM 

[330] 

Modified  to 

Radiolabeled 
binding  assay 

EPO  agonist 

EPO  Agonist 
ICS0  =  4.4  pM 

[330] 

EPO/EPOR 

Solution  phase 
library 

Radiolabeled 
binding  assay 

30  (Figure  19) 

EPO  Agonist 
EC50  =  8  jim 

[82] 

TPO/TPOR 

Phage  display 

ELISA 

TPO  Agonist 

ECso  =  400  nM 

[212] 

Modified  to 

ELISA 

AF13948  (dimer) 

ECso  =  100  pM 

[212] 
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Table  3:  (Continued) 

Protein-protein 

Interaction 

Compound  Source 

Screen 

Lead  Structure**1 

Activity 

Ref. 

TPO/TPOR 

Corporate  library 

Radiolabeled 
binding  assay 

TM41 

TPO  Agonist 

[213] 

GCSF/GCSFR 

Corporate  library 

Functional 

assay 

SB247464 

Agonist 

[214] 

C5a/C5aR 

Design  inhibitor 

Radiolabeled 
binding  assay 

C5a  antagonist 

IC50  =  300  nM 

pi  si 

FKBP/Cn 

Natural  product 

- 

FK506 

Cn  inhibitor 

[216] 

Cyclophilin/Cn 

Natural  product 

- 

Cyclosporin  A 

Cn  inhibitor 

[216217] 

FKBP/FRAP 

Natural  product 

— 

Rapamycin 

Immunosup¬ 

pressant 

[218] 

Chemical  induc¬ 
ers  of  dimeriza¬ 
tion  (CIDs) 

Synthetic  con¬ 
structs 

Various 

Representative 

CID 

Various 

[192-196] 

Ribosome 

(RNA)/protein 

Natural  products 

Various 

Macrolides 

Tetracycline 

Oxazolidinones 

Antibiotics 

[219] 

Angewandte 

Chemie 


Streptogramin 

Chloramphenicol 

TAN  1057 

Pestalone 

Cycloheximide 

Hygromycin 

Blasticidin 

Virginamycin 

Sparsomycin 

Puromycin 

Clindamycin 


Hsp90/Client 

Protein 

Natural  products 

Various 

Geldanamycin 

Herbimycin 

Radidicol 

17-AAG 

Antitumor 

compounds 

[220a] 

Hsp90/Client 

Protein 

Designed  ligand 

Competitive 

binding 

PU3 

ECS0  =  15- 

20  p.M 

[220b] 

a/p  tubulin 

Natural  products 

Various 

Colchicines 

Vincristine 

Taxol 

Epothilones 

Mitotic  inhib¬ 
itors 

[221a] 

oc/P  tubulin 

Commercial 

Library 

Cell-based 

screen 

Synstab  A 

Mitotic  Inhib¬ 
itors 

[221b] 

Topoisomerase/ 

DNA 

Natural  product 

Various 

Camptothecin 

Antitumor 

compounds 

[222] 

PG/Transpepti- 

dase 

Natural  product 

Various 

Vancomycin 

Antibiotic 

[223a] 

LI  ll/Transglyco- 
sylase 

Natural  product 

Various 

Ramoplanin 

Antibiotic 

[,223b] 

Nuclear  hor¬ 
mone  receptors 

Various 

Various 

Various 

Agonists  and 
antagonists 

[224] 

RGD  mimetics 

Various 

Various 

Various 

Antagonists 

[240-252] 

SH2  domains 

- 

- 

- 

- 

[181,225a] 

SH3  domains 

- 

- 

- 

- 

[225b] 

WW  domains 

- 

- 

- 

- 

[225c] 

PDZ  domains 

- 

- 

- 

- 

[225d] 

[a]  Lead  structure  is  shown  in  Scheme  7  unless  indicated  otherwise. 

tein-protein  interaction  targets  is  warrented.|loo"'n2-l7U8MS21 
Interestingly,  fewer  of  these  examples  represent  results 
derived  from  screening  combinatorial  libraries  than  might 
be  expected  and  most  that  do  relied  on  phage  display  to 
generate  initial  peptide  leads.  The  examples  in  Table  3 
represent  a  wide  range  of  strategies  despite  the  commonality 
of  all  in  perturbing  a  protein-protein  interaction.  For 
example,  those  that  entail  the  inhibition  of  enzyme  action 


do  so  by  a  range  of  mechanisms 
including  directly  preventing  their 
homodimerization  (HIV  protease) 
or  heterodimerization  (ribonucleo¬ 
tide  reductase),  or  inducing  a  con¬ 
formational  change  that  precludes 
dimerization  (iNOS),  effecting 
their  localization  (MMP-2,  CaM), 
access  to  the  substrate  (transpepti¬ 
dase,  transglycosylase),  stabiliza¬ 
tion  of  an  intermediate  ternary 
complex  (topoisomerase),  or  pro¬ 
moting  inhibitory  ternary  com¬ 
plexes  with  additional  proteins 
(calcineurin  (Cn)).  Not  included 
in  this  summary  are  enzyme 
active-site  inhibitors  that  block  a 
peptide  or  protein  substrate  bind¬ 
ing  to  the  enzyme,  small-molecule 
agonists  and  antagonists  of  G-pro- 
tein-coupled  receptors,  and  all  the 
cell  surface  integrin  inhibitors 
based  on  the  RGD  recognition 
motif.  The  G-protein-coupled 
receptor  antagonists  typically  bind 
the  receptor  transmembrane 
domain  and  do  not  directly  (per¬ 
haps  allosterically)  perturb  a 
receptor-ligand  interaction.  In 
contrast,  the  cell  surface  integrin 
inhibitors  have  been  the  subject  of 
several  reviews1240"2521  and  consti¬ 
tute  a  beautiful  example  of  the 
discovery  of  a  simple,  small,  and 
general  protein-protein  interac¬ 
tion  motif  that  lends  itself  to  sub¬ 
sequent  antagonist  development. 
Similarly,  Src-homology  2  domains 
(SH2),  SH3,  WW,  and  PDZ 
domains  constitute  common  pro¬ 
tein-protein  binding  sites  with 
small,  focused  interfaces  for  pro¬ 
tein  localization  or  signal  trans¬ 
duction  that  have  been  the  subject 
of  small-molecule  screening  or 
design  efforts.(lnu8,-22S|  In  addition, 
there  are  several  historically 
important  modulators  of  protein- 
protein  interactions  that  have 
defined  in  such  discussions  impor¬ 
tant  therapeutic  targets.  Although 
these  are  often  neglected,  they  are  superb  examples  of 
achieving  productive  therapeutic  endpoints  by  interfering 
with  a  protein-protein  (or  protein-DNA)  interaction.  Nota¬ 
bly,  there  may  be  significant  distinctions  in  the  work,  design  of 
the  library,  screening  methods,  and  challenge  of  the  inves¬ 
tigations  depending  on  whether  it  is  an  antagonist  or  agonist 
of  a  protein-protein  interaction  that  is  sought.  To  date,  the 
consensus  is  that  the  discovery  of  antagonists  may  be  much 
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Scheme  7.  Representative  small-molecule  modulators  of  protein-protein  interactions  listed  in  Table  3. 
©  2003  Wiley-VCH  Verlag  GmbH  &  Co.  KGaA,  Weinheim  WWW.angewandte.org 


Angew.  Chem.  Int.  Ed.  2003,  42,  4138-4176 


Combinatorial  Chemistry 


Angewandte 

Chemie 


easier  than  an  agonist.173,170-1711  As  such,  the  efforts  related  to 
the  discovery  of  EPO.  TPO.  and  GCSF  agonists  (mimetics; 
Table  3)  constitute  remarkable  achievements.  Finally,  the 
development  of  general  methods  for  establishing  the  selec¬ 
tivity  of  such  antagonists  or  agonists  is  a  challenge  that  still 
has  not  been  faced  by  the  field  and  that  may  constitute  a 
major  reason  for  the  reluctance  to  address  protein-protein 
interactions  as  useful  therapeutic  targets. 

Several  very  clever  ways  of  enlisting  chemically  prepared 
bifunctional  protein  binders  that  induce  dimerization  to 
achieve  a  biological  endpoint  or  objective  have  been 
detailed.11911  These  chemical  inducers  of  protein  dimerization 
(CIDs)  pioneered  by  Schreiber  and  Crabtree  et  al.  have  been 
used  to  modulate  cell  membrane  receptor  signaling, 11921  to 
control  gene  expression,11920931  to  selectively  antagonize 
cellular  processes,11941  to  identify  protein-ligand  partners,11951 
and  even  to  target  protein  heterodimers  with  no  established 
ligand.1'961 

Cell  growth,  differentiation,  migration,  and  apoptosis  are 
regulated  in  part  by  growth  factors  or  cytokines.  These  factors 
are  unable  to  penetrate  the  cell  membrane  and  exert  their 
effects  by  binding  to  cell  surface  receptors.  In  many  instances, 
such  receptors  are  activated  by  ligand-induced  dimerization 
or  oligomerization.1183"11161  In  addition,  several  components  of 
the  intracellular  signal-transduction  pathways  are  also  regu¬ 
lated  by  dimerization.  For  instance,  certain  cytoplasmic 
signal-transduction  molecules  dimerize  after  activation,  and 
the  active  form  of  a  transcription  factor  is  often  a  dimer.1187"™1 
Thus,  protein  dimerization  has  emerged  as  a  general  mech¬ 
anism  for  the  initiation  and  downstream  regulation  of  signal 
transduction.  The  targets  listed  below  that  we  selected  to 
pursue  were  chosen  not  only  for  their  therapeutic  importance, 
but  also  because  each  constitutes  a  distinct  stage  at  which  to 
modulate  cellular  signaling  by  controlling  such  protein-pro¬ 
tein  or  protein-DNA  interactions: 

•  Promote  a  cell  surface  receptor  homodimerization  that  is 
required  for  activation  (EPO  agonist). 

•  Inhibit  an  extracellular  cell  surface  integrin-protein 
interaction  (MMP-2/a,p3). 

•  Inhibit  an  intracellular  cell  surface  integrin-protein  inter¬ 
action  (Paxillin/a4fS,). 

•  Inhibit  the  intracellular  heterodimerization  of  a  tran¬ 
scription  factor  (Myc/Max,  LEF-l/fi-catenin). 

•  Inhibit  the  protein-DNA  interaction  of  a  transcription 
factor  (LEF-l/p-catenin,  androgen  receptor). 

The  more  personal  account  of  our  own  studies  on  this  series  of 
targets  that  follows  highlights  some  important  aspects  of 
target  selection,  screening  technology,  and  library  generation. 


4.1.  MMP-2/aJ, 


MMP-2  (gelatinase  A),  a  member  of  the  matrix  metal¬ 
loproteinase  family,  is  secreted  by  vascular  endothelial  cells 
and  plays  a  crucial  role  in  degrading  the  extracellular  matrix 
during  tumor-induced  angiogenesis.12261  Consequently,  MMP- 
2  inhibitors  have  been  extensively  pursued  and  developed 
which  diminish  tumor  growth  and  metastasis.1226"2311  Another 
key  participant  in  angiogenesis  is  the  integrin12321  avp3,  which 


A) 


ceil 

nssrr  brans 


if; 

h-E  bindirs  /si 

catalytic. 

\  Tpx  17) 

^angiogenesis  J 

integrin  j 

«vib  I 

Jj 

/“N 

\ 

v'  ■  Ir  ; 

Jocattoston  *  |:-J'  j 

ant*  aeiivatkw  I 

of  MMP-2  on 

Ms 

L 

trix 

integrin  surface  | 

S" 

*"  degradation 

u 

j 

V  J 

U 

V,,,.v 

Radiolabeled  binding 
assay  with  immobilized 

"A  , —  — 

r \r 


.rv'gp 


A 


1% 

M 


small  msteeule 
antagonist 


JL 


f rfiSif 


sruptior  of  ’’j 

■angiogenesis  J 


B)  Library  of  600  compounds  provided: 

{8  subunit  most  critical  to  activity) 


Library  of  77  analogues  with  improved  lead  structure 
(B  subunrt  features  contributing  to  activity  identified) 


Substructure 

scanning 


Simplified  lead:  better  solubility  and  in  vivo  antitumor  activity; 
target  validation 


Figure 3.  A)  Disruption  of  angiogenesis  by  blocking  matrix  metallo  pro¬ 
teinase  2  (MMP-2)  binding  to  integrin  aJSj.  B)  Developing  the  angio¬ 
genesis  inhibitor  27. 


mediates  cellular  interactions  with  the  extracellular 
matrix.1231  233"2391  This  heterodimeric  cell  surface  receptor 
recognizes  certain  matrix  proteins  containing  the  RGD 
(Arg-Gly-Asp)  peptide  sequence,  which  leads  to  integrin 
clustering,  and  ultimately  supporting  the  migration  and 
survival  of  endothelial  cells.  Soluble  RGD  peptides,  non¬ 
peptide  RGD  mimetics,  and  antibodies  that  disrupt  a,fS3 
ligation  have  been  shown  to  inhibit  angiogenesis  and  tumor 
growth.1249"2521  Conservative  estimates  indicate  that  at  least 
40  pharmaceutical  firms  are  developing  angiogenesis  inhib¬ 
itors  directed  at  these  two  targets  with  at  least  27  drugs 
currently  in  clinical  trials.1231-253"2581 

Recently,  a  novel  interaction  between  MMP-2  and 
integrin  avf53  was  observed  by  Cheresh  et  al.  when  these 
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proteins  were  found  to  colocalize  on  the  surface  of  angiogenic 
blood  vessels  in  vivo.12591  Integrin  avf>,  binds  MMP-2,  and  this 
interaction  is  a  requisite  step  in  the  cellular  utilization  of  the 
enzyme  on  the  surface  of  invasive  endothelial  cells.1260'  A 
noncatalytic,  193-residue  C-terminal  fragment  of  MMP-2, 
termed  PEX,  was  found  to  inhibit  MMP-2  binding  to  ocv(iS3  and 
to  indirectly  block  its  cell  surface  proteolytic  activity.126'1  PEX 
was  further  shown  to  disrupt  angiogenesis  and  tumor  growth 
in  the  chick  chorioallantoic  membrane  (CAM).  A  naturally 
occurring  form  of  PEX  can  be  detected  in  vivo  in  association 
with  cells  expressing  avp3,  which  implicates  the  role  of  PEX  as 
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Figure  4.  Screening  of  a  600-compound  library  that  provided  the  initial 
lead  A6B10C4  for  inhibition  of  MMP-2  binding  to 


an  endogenous  regulator  of  angiogenesis.  These  observations 
suggested  that  disrupting  the  extracellular  binding  of  MMP-2 
to  integrin  avp,  may  be  a  promising  approach  to  controlling 
angiogenesis,  and  could  provide  a  novel  therapeutic  target  for 
angiogenesis-dependent  diseases  including  cancer  (Figure  3). 

The  structural  details  of  the  MMP-2/ctvf53  interaction  are 
unknown,  although  it  has  been  established  that  the  integrin 
binding  site  is  distinct  from  that  which  recognizes  the  RGD 
sequence  in  traditional  high-affinity  ligands  (e.g.,  vitronec- 
tin).|26n|  In  the  absence  of  further  knowledge  of  the  target 
structure,  a  combinatorial  chemistry  approach  was  used  to 
identify  small-molecule  MMP-2/a.(l3  antagonists.  This 
approach  provided  the  first  members  of  a  new  class  of  anti- 
angiogenic  compounds  that  derive  their  activity  by  inhibiting 
this  protein-protein  interaction,  and  validated  this  extracel¬ 
lular  target  for  therapeutic  intervention.'77  ™' 

The  initial  lead  came  from  a  library  that  was  screened  as 
60  mixtures  of  10  compounds  (Figure  4).  The  most  effective 
inhibitors  contained  the  BIO  subunit  and  each  of  the 
individual  compounds  in  several  of  these  mixtures  was 
prepared  for  identification  of  the  active  constitutents. 
Because  the  10-compound  mixture  was  generated  in  the  last 
step  of  the  library  synthesis,  this  entailed  simply  repeating  the 
last  step  of  the  synthesis  with  individual  (instead  of  mixture) 
C  subunits  on  the  archived  samples  of  the  immediate 
precursor. 

Thus,  the  screening  of  the  compound  libraries  in  a 
radiolabel  binding  assay  using  the  immobilized  integrin  led 
to  the  initial  identification  of  A6B10C4,  its  refinement  to 
A6B19C4  and  its  subsequent  simplification  to  provide  27 
which  also  possesses  improved  solubility  and  in  vivo  activity 
(Figure  3).  The  inhibitor  27  was  shown  to  bind  av(J3  and 
disrupt  the  binding  of  MMP-2,  but  not  the  integrin  ligand 
vitronectin.  It  inhibits  collagen-matrix  degradation  without 
directly  inhibiting  MMP-2  enzymatic  activity,  was  shown  to  be 
a  potent  inhibitor  of  angiogenesis  in  a  chick  CAM  model,  and 
led  to  near  complete  reduction  in  the  growth  of  CS-1 
melanoma  tumors  in  vivo  in  a  chick  CAM  model.  Notably,  a 
close  analogue  of  27  (the  benzamide  rather  than  the 
carbamate)  was  inactive  and  used  as  a  control.'77,78' 


4.2.  Paxillin/a4:  A  Successful  Use  of  Positional  Scanning 

The  integrin  a-fi,  (also  know  as  VLA4,  Very  Late 
Antigen  4)  is  a  cell  surface  receptor  that  plays  an  important 
role  in  embryogenesis,  hematopoiesis,  and  the  immune 
response.'262-267'  It  binds  natural  ligands  including  vascular 
cell  adhesion  molecule  1  (VCAM-1)  and  an  alternatively 
spliced  connecting  segment  (CS-1)  from  the  extracellular 
matrix  protein  fibronectin.  It  mediates  cellular  adhesion  and 
activation  through  a  variety  of  cell-cell  and  cell-matrix 
interactions  that  regulates  leukocyte  migration  into  tissues 
during  inflammatory  responses  and  lymphocyte  traffick¬ 
ing.'262-267'  This  integrin  is  believed  to  regulate  cellular 
functions  differently  from  other  integrins  because  the  a4 
cytoplasmic  tail  binds  tightly  to  the  signaling  adapter  protein 
Paxillin  through  a  short  conserved  sequence  motif  dominated 
by  two  residues  (Glu983  and  Tyr991  ).'268'  The  cq/Paxillin 
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Figure  j.  Top:  Inhibition  of  cell  migration  by  blocking  intracellular  paxillin/a4  binding,  middle:  identification  of  the  lead  structure  A7B7C7  through 
high-throughput  screening  using  an  ELISA  assay  and  bottom:  description  of  the  high-throughput  screen  (ELISA).  VLA-4  =  very  late  antigen  4, 

Pax  =  paxillin,  HRP  =  horseradish  peroxidase 


interaction  leads  to  enhanced  rates  of  cell  migration  and 
reduced  rates  of  cell  spreading,  focal  adhesion,  and  stress 
fiber  formation  (Figure  5).|2f>9'270'  These  biological  responses 
to  integrin-mediated  cell  adhesion  contribute  to  leukocyte 
migration  and  changes  in  gene  expression  important  in 
chronic  inflammation.  While  extracellular  inhibitors  for  this 
class  of  integrins  are  under  development  for  the  treatment  of 
asthma  and  multiple  sclerosis,1271'2731  inhibitors  targeting  the 
unique  cytoplasmic  Paxillin/a4  interaction  had  not  been 
explored. 

The  target  protein-protein  interaction  again  represents 
one  in  which  no  structural  information  is  available  and  no 
existing  inhibitors  had  been  disclosed.  The  examination  of  our 
libraries  (ca.  40000  compounds)  in  an  ELISA  assay  enlisting 
the  immobilized  a4  cytoplasmic  tail  and  screening  for 
inhibition  of  Paxillin  binding  provided  the  first  and  potent 
inhibitors  of  this  protein-protein  interaction  (Figure  5).  The 
preparation  of  key  substructure  analogues  of  A7B7C7 
identified  structural  features  required  for  activity,  those 
available  for  further  modifications,  and  those  which  may  be 
removed  or  modified  without  significantly  changing  the 
activity.  The  most  potent  lead  structure  was  shown  to  inhibit 
cqfVmediated  human  JurkatTcell  migration  in  a  dose- 
dependent  manner  validating  the  intracellular  Paxillin/a4 
interaction  as  a  useful  and  unique  target  for  therapeutic 
intervention.  Notably,  such  target  validation  is  often  first 
established  by  biological  techniques  or  with  monoclonal 


antibodies  or  peptide  consensus  sequences  derived  from  the 
receptor  or  its  ligands.  Consequently,  it  is  of  note  that  even  the 
initial  target  validation  as  well  as  its  suitability  for  small- 
molecule  therapeutic  intervention  came  from  the  screening  of 
small-molecule  libraries.  Moreover,  the  lead  structure 
emerged  from  a  library  of  1000  compounds  that  had  been 
prepared  in  two  formats:  l)a  traditional  small-mixture 
format  composed  of  100  mixtures  of  10  compounds,  and  2)  a 
less  traditional  positional-scanning  library.  Parallel  testing  of 
both  libraries  provided  the  rare  opportunity  to  critically 
compare  the  results  derived  from  the  two  approaches.1791 

Thus,  an  important  feature  in  this  work  rested  with  the 
parallel  screening  of  a  positional-scanning  library  for  which 
every  component  contained  in  a  traditional  compound  library 
was  also  present,  but  assembled  such  that  immediate  decon¬ 
volution  was  possible.  Scanning  for  the  best  subunit  at 
position  A  by  screening  sublibrary  AxBC  showed  that  the 
greatest  inhibition  was  observed  for  mixture  A7BC,  identify¬ 
ing  the  CDPI  subunit  as  the  best  subunit  at  position  A 
(Figure  6).  Using  the  same  procedure,  CDPI  was  also 
identified  as  the  most  effective  subunit  at  positions  B  and  C 
(Figure  6).  Immediate  deconvolution  of  the  results  identified 
the  A7B7C7  combination  as  a  potent  Paxillin/a4  binding 
antagonist.  This  same  lead  compound  (A7B7C7)  was  identi¬ 
fied  from  the  simultaneous  and  more  traditional  library 
screening.  This  unbiased  screening  success  must  be  tempered 
by  the  fact  that  the  positional-scanning-library  screening  and 
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4.3.  Myc/Max 

The  c-myc  proto-oncogene  is  involved  in  the  progression 
of  a  wide-range  of  human  tumors.1274-2841  Proteins  in  the 
immediate  Myc  network  are  essential  regulators  of  cell 
growth  and  differentiation.  Activation  and  transformation 
occur  mainly  through  elevated  expression  of  its  gene  product 
Myc,  a  short-lived  nuclear  protein  and  an  important  member 
of  the  basic  helix-loop-helix  leucine  zipper  (bHLHLZ)  tran¬ 
scription  factors  (Figure  7).l28S  28f,l  The  oncogenic  potential  of 
c-myc  has  been  demonstrated  in  transgenic  animals,  recon¬ 
stituted  organs  or  cell  culture,  and  typically  requires  the 
action  of  at  least  one  additional  oncogene,  such  as  ras  or  bcl-2 
(inhibits  apoptosis).1274-2841  In  normal  cells,  Myc  is  required  for 
cell  proliferation  and  prevents  differentiation.  It  is  transiently 
expressed  in  response  to  mitogenic  stimuli  and  it  has  a  short 
half-life  (tyl  —  20-30  min).  Its  aberrant  expression  in  the 


Transcriptional 
activation  domain 


Cl 


mm 


ft/2  =  20-30  min 


tv2  >  14  h 


f,/2=  15-30  min 


DNA — [cACGTGI 


Transcription  activation 
Cell  proliferation 
Oncogenesis 


Transcriptional 

machinery  II 


Transcription  Inhibition 
and  regulation 


t 


DMA — |  CACGTGfc 


Transcription  repression 
Growth  arrest 
Differentiation 


Co-repressor? 


DNA— ICACGTCl — 


Figure  6.  Inhibition  of  paxillin/a4  binding  by  the  positional  scanning 
libraries  AxBC  (top,  A  scan),  AByC  (middle,  B  scan),  and  ABCz  (bot¬ 
tom,  Cscan).  Each  mixture  of  100  compounds  was  tested  in  triplicate 
at  5,  10,  and  20  gM  (total  compound). 


direct  deconvolution  did  not  identify  all  candidate  inhibitors 
which  were  identified  from  the  traditional  library.  This 
situation  is  a  natural  consequence  of  testing  larger  100- 
compound  mixtures  and  the  relative  insensitivity  of  the  assay 
to  the  contribution  of  any  single,  uniquely  acting  compound  in 
the  mixture.  Thus,  the  global  observations  were  effectively 
detected  with  the  positional-scanning  library,  and  a  useful 
lead  structure  with  defined  properties  was  identified.1791 
However,  more  subtle  discoveries  within  the  libraries  were 
not  identified.  Thus,  the  disadvantages  associated  with  the 
loss  of  their  detection  and  this  information  contained  within 
the  library  must  be  balanced  against  the  advantages  of  the 
ease  of  synthesis  of  the  parent  libraries  and  judged  in  light  of 
the  objectives  of  the  library  screening.  As  in  this  case,  the 
positional-scanning  libraries  typically  would  be  only  effective 
for  lead  identification  and  would  be  less  suitable  for  lead 
optimization. 


Figure  7.  Top:  Myc-Max-Mad  network  of  transcription  control:  cell  pro¬ 
liferation  versus  differentiation  Max  =  Myc  associated  factor  X.  Bot¬ 
tom:  Max(22-1 13)  dimer  complexed  with  DNA.  Reproduced  with  per¬ 
mission  from  Nature,  reference  [285], 
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absence  of  such  growth  factors  drives  cells  into  cell  cycle  but 
also  induces  apoptosis  unless  co-oncogenic  mechanisms  are 
present  (i.e. ,  inhibition  by  constitutive  bcl-2  overexpression). 

All  known  activities  of  Myc  require  heterodimerization 
with  Max,'285"2931  a  constitutively  expressed  bHLHLZ  protein 
that  is  stable  (tm  =  >  14  h)  and  always  present.  The  Myc/Max 
heterodimer  binds  the  DNA  sequence  CACGTG  (E  box 
element)  and  activates  transcription  through  the  transactiva¬ 
tion  domain  of  Myc.  DNA  binding  occurs  through  the  basic 
region  while  both  the  HLH  and  the  leucine  zipper  form  the 
dimerization  interface.1285,2861  Max,  but  not  Myc,  forms  homo¬ 
dimers  which  bind  the  same  DNA  site  and  inhibit  tran¬ 
scription.  In  part,  this  results  from  the  competition  for  Max 
and  for  the  common  DNA  site.  In  addition.  Max  hetero- 
dimerizes  with  another  family  of  bHLHLZ  proteins  that 
include  Mad  (Mad-1),  Mxi-1,  Mad-3,  and  Mad-4  that  are 
transiently  expressed  (tm  =  15-30  min)  and  signal  growth 
arrest  and  differentiation.'296, 2971  Their  dimerization  behavior 
is  similar  to  that  of  Myc  in  that  they  do  not  homodimerize  or 
interact  with  Myc  family  members,  but  readily  form  hetero¬ 
dimers  with  Max  that  bind  the  CACGTG  core  consensus 
sequence.  Mad  and  Myc  compete  for  Max  with  approximately 
equal  affinities  and  the  heterodimers  are  formed  in  prefer¬ 
ence  to  Max  homodimerization.  Mad/Max  heterodimers 
repress  or  antagonize  Myc/Max  transcriptional  activation.  In 
normal  cells,  the  expression  of  Mad  is  induced  by  differ¬ 
entiating  stimuli  while  Myc  expression  is  repressed.1274"2831 
Thus,  Myc/Max  dimers  induce  proliferation  and  apoptosis  in 
nontransformed  cells  while  Max/Max  and  Mad/Max  are 
involved  in  growth  arrest,  differentiation,  and  cell  survival. 
At  the  center  of  this  network  lies  Max  which  is  metabolically 
stable  (f,/2  =  >  14  h)  and  constitutively  expressed.  Its  hetero¬ 
dimer  partners,  Myc  ( tm  =  20-30  min)  and  Mad  (r,/2  =  15- 
30  min),  are  rapidly  degraded  and  highly  regulated.  Mitogenic 
stimulation  induces  a  rapid  rise  in  Myc  levels,  and  a  shift  in  the 
equilibrium  to  Myc/Max  dimers  and  transcriptional  activa¬ 
tion.  In  transformed  cells,  when  accompanied  by  oncogenic 
events  inhibiting  apoptosis,  constitutive  Myc  expression 
maintains  the  cell  proliferation  state  and  prevents  growth 
arrest  and  differentiation.1274"2831  Thus,  the  function  of  the 
Myc-Max-Mad  network  relies  on  four  major  features: 

1)  Max  is  a  required  dimerization  partner  for  Myc  (or  Mad) 
to  bind  DNA  and  execute  its  biological  functions. 

2)  Myc/Max  and  Mad/Max  heterodimers  form  in  preference 
to  Max/Max  homodimers. 

3)  The  highly  regulated  expression  levels  of  Myc  and  Mad 
induce  rapid  changes  in  the  equilibrium  between  the 
dimers.  The  Myc/Max  heterodimer  activates  transcription 
throughout  all  stages  of  the  cell  cycle  and  maintains  the 
cell  in  a  state  of  cell  growth  and  proliferation  while  the 
Mad/Max  heterodimer  signals  an  exit  from  the  cell  cycle, 
growth  arrest,  and  differentiation. 

4)  All  dimers  compete  for  common  target  DNA  sites,  and 
the  Myc/Max  transcription  activation  is  antagonized  by 
the  other  dimers.  Thus,  the  aberrant  expression  of  Myc 
that  leads  to  transformation  may  be  addressed  through 
inhibition  of  Myc/Max  heterodimerization  and  its  subse¬ 
quent  transcriptional  activation.  This  provides  an  attrac¬ 


tive  therapeutic  target  for  the  treatment  of  cancer  that  is 
central  to  the  disease. 

A  novel  FRET  assay  for  monitoring  Myc/Max  dimerization 
was  developed  specifically  for  this  work  (Figure  8).  Thus,  the 
fusion  proteins  (MycCFP  and  MaxYFP)  consisting  of  the 
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Figure  8.  Fluorescence  resonance  energy  transfer  (FRET)  assay.  Addi¬ 
tion  of  dimerization  inhibitors  induces  a  decrease  in  the  intensity  at 
527  nm  (dimer)  and  an  increase  in  intensity  at  475  nm  (monomer). 
Compounds  that  give  rise  to  a  large  ratio  lmlli2j  are  selected. 

CFP  =  cyan  fluorescent  protein,  YFP  =  yellow  fluorescent  protein. 


bHLHLZip  domain  of  human  c-myc  linked  to  the  N-terminus 
of  cyan  fluorescent  protein  (CFP)  and  the  bHLHLZip 
domain  of  Max  fused  to  the  N-terminus  of  yellow  fluorescent 
protein  (YFP)  were  cloned,  expressed,  and  purified  for  use  in 
the  assay.  Inhibition  of  dimerization  is  measured  as  a  loss  of 
the  FRET  signal  emission  at  527  nm  (YFP)  following 
excitation  of  CFP  at  433  nm.  Using  this  assay,  our  library  of 
40000  compounds  was  screened  for  inhibition  of  Myc/Max 
dimerization.  Two  closely  related  leads  emerged  from  an 
isoindoline  library  of  240  compounds  (Figure  9).  These 
remarkably  small  inhibitors  (38%  FRET  inhibition  at  25  |xm 
for  28)  were  subsequently  shown  to  inhibit  Myc/Max  binding 
in  a  conventional  ELISA  assay  and  to  inhibit  Myc/Max  DNA 
binding  in  a  gel-shift  assay  confirming  the  observations  made 
through  use  of  the  novel  FRET  assay.  Impressively,  the  two 
most  potent  leads  were  shown  to  inhibit  Myc-induced  trans¬ 
formation  of  chicken  embryo  fibroblasts  (IG)0  =  20  p.M) 
establishing  functional  activity  of  the  compounds  in  a  cellular 
assay.'80! 


4.4.  LEF-1  /P-catenin 

The  majority  of  colorectal  tumors  contain  mutations  in 
the  tumor-suppressor  protein,  adenomatous  polyposis  coli 
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Figure  9.  Inhibition  of  Myc-Max  heterodimerization  and  aberrant  gene 
transcription:  antitumor  target  validation. 


(APC)  of  the  colon,  that  result  in  the  release  and  accumu¬ 
lation  of  P-catenin  in  the  cell  nucleus.  p-Catenin  binds  to  and 
activates  transcription  factors  including  LEF-1  (Lymphoid- 
enhancers  binding  factor  i).I2iW-30|l  This  results  in  upregulated 
and  aberrant  gene  expression  which  is  a  key  step  in  the 
development  of  colon  cancer  (Figure  10).  The  LEF-1  tran¬ 
scription  factors  (TCF)  that  interact  with  P-catenin  share  an 
identical  DNA-binding  domain  referred  to  as  the  high 
mobility  group  (HMG)  domain  recognizing  the  sequence  5'- 
CTTTGWW-3'  (W  =  A  or  T).  Importantly,  LEF-1  binds  the 
DNA  minor  groove13021  through  its  HMG  domain  making  it  an 


Figure  10.  Inhibition  of  LEF-1 /p-catenin-mediated  gene  transcription: 
controlling  aberrant  gene  transcription  and  antitumor  target  validation. 
More  than  40000  compounds  were  screened  for  inhibition  of  LEF-1 
mediated  gene  transcription  in  a  TOPFLASH  luciferase  reported  assay. 
Two  leads  A1B2C8  and  A1B3C9  exhibit  TOPFLASH  selectivity,  they  are 
derived  from  a  150-member  non-amide  based  IDA  library  of  piperazi- 
nones. 


ideal  target  for  libraries  of  DNA  minor-groove  binding 
ligands.  When  P-catenin  interacts  with  the  amino  terminus 
of  LEF-1,  it  activates  transcription.  In  the  absence  of  P- 
catenin,  LEF-1  actively  represses  transcription.  It  is  thought 
that  LEF-1  bound  P-catenin  displaces  corepressors  and  p- 
catenin  binds  LEF-1  through  a  domain  distinct  from  that  of 
the  corepressors.  The  oncogenic  potential  of  P-catenin 
through  its  interaction  with  LEF-1  has  been  demonstrated 
by  Vogt  et  al.13031 

Initial  studies  were  conducted  screening  our  library  of 
40000  compounds  enlisting  a  TOPFLASH/FOPFLASH  cell- 
based  reporter  assay|29S1  transiently  transfected  into  a  colon 
cancer  cell  line  (SW480)  that  has  an  APC  mutation  which 
allows  P-catenin  to  accumulate  in  the  nucleus.  The  use  of  a 
luciferase  reporter  assay  permits  the  detection  of  leads  that 
act  by  disrupting  the  LEF-l/p-catenin  binding  interaction  or 
that  inhibit  LEF-l/DNA  binding.  This  approach  led  to  the 
identification  of  two  classes  of  small  molecules  that  inhibit 
LEF-l/p-catenin-mediated  gene  transcription.  The  first  class, 
of  which  A1B2C8  (3.7-fold  inhibition  at  10  pM)  and  A1B3C9 
(4.8-fold  inhibition  at  10  |xm)  are  representative,  proved  to  be 
a  surprising  class  of  small  molecules  that  probably  do  not  act 
by  inhibiting  LEF-l/DNA  binding,  but  rather  by  inhibiting 
LEF-l/p-catenin  binding  (Figure  10).  Their  evaluation  in  a 
functional  cellular  assay  revealed  that  they  inhibit  the  P- 
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catenin-induced  tumor  cell  transformation  of  the  colon  cell 
line  SW480.  The  initial  library  of  150  members  from  which  the 
lead  emerged  was  expanded  to  more  than  350  members  with 
the  identification  of  structural  features  important  for  activity 
in  the  reporter  and  functional  assays.171'  These  studies  are  now 
being  expanded  to  larger  libraries  based  on  A1B2C8  and 
A1B3C9  and  with  the  development  of  a  FRET  assay  capable 
of  directly  screening  for  disruption  of  LEF-l/(5-catenin  bind¬ 
ing.  This  assay  will  serve  to  define  the  site  of  action  of  the 
leads  related  to  A1B2C8  and  should  provide  new  candidate 
lead  structures  through  the  screening  of  future  libraries.  The 
second  class  of  selective  TOPFLASF1  inhibitors  emerged 
from  a  library  of  minor-groove  DNA  binding  compounds  and 
are  discussed  in  detail  in  Section  5.3. 


4.5.  EPO/EPOR:  Agonists  from  Antagonists 

Erythropoietin  (EPO),  a  34-kDa  monomeric  glycoprotein 
composed  of  165  amino  acid  residues  and  an  almost  equiv¬ 
alent  mass  of  carbohydrate,  is  the  principle  factor  regulating 
red  blood  cell  production.'304-311'  EPO  exerts  its  effects  by 
controlling  the  proliferation,  differentiation,  and  maturation 
of  erythroid  progenitors.  The  EPO  gene,  which  shows  no 
homology  with  other  known  genes,  is  a  single-copy  gene  and 
highly  conserved  across  different  species.  Recombinant 
human  EPO  has  found  clinical  applications  in  the  treatment 
of  anemias  particularly  those  resulting  from  renal  disease 
including  kidney  damage,  anemia  resulting  from  malignancy, 
AIDS  or  AIDS  treatment,  and  anemia  associated  with 
myelodysplastic  syndromes.'1011"311'  While  EPO  is  presently 
the  most  important  biotechnology  product  and  its  use  has 
evolved  into  a  >  $7  billion/year  market,  its  administration 
requires  intravenous  or  subcutaneous  injection.  This  situation 
has  led  to  an  interest  in  developing  small-molecule  mimics  of 
EPO  to  replace  the  recombinant  human  protein.  More 
recently,  EPO  itself  is  thought  to  be  a  potentially  useful 
antitumor  agent,'312'  although  the  mechanism  is  not  yet  well 
understood.  Prompted  by  observations  that  EPO-treated 
patients  experience  longer  survival  times  and  more  complete 
remissions  than  untreated  patients,  the  in  vivo  evaluation  of 
EPO  itself  for  antitumor  efficacy  was  successfully  con¬ 
ducted.'312'  EPO  elicits  its  response  through  binding  and 
homodimerization  of  its  receptor  EPOR1313-3161  which  is  a 
member  of  the  Class  I  cytokine  receptors  (Table  4).,183"IS5'  The 
Class  I  cytokine  receptors  are  characterized  by  the  presence 
of  one  or  two  conserved  200-amino  acid  fibronectin  type  III- 
like  domains,  four  conserved  cysteine  residues,  and  the 
conserved  motif  Trp-Ser-Xaa-Trp-Ser  (WSXWS)  in  the 
extracellular  domain.  The  intracellular  domains  of  the 
cytokine  receptors,  unlike  the  receptor  tyrosine  kinases,  lack 
intrinsic  enzymatic  activities.  Ligand  binding  induces  dimeri¬ 
zation  or  oligomerization  and  this  allows  the  interaction 
(dimerization)  and  activation  (cross  phosphorylation)  of 
noncovalently  associated  tyrosine  kinases  (Janus  Kinases: 
JAKs),  Figure  ll.l47,7U8S'311,l  In  turn,  the  activated  JAKs 
phosphorylate  cytoplasmic  domain  tyrosines  of  the  receptor 
enlisted  to  recruit  signaling  proteins  including  STATs  (Signal 
Transducers  and  Activators  of  Transcription).'187, 18S'  Recruit- 


Table  4:  Class  I  cytokine  receptors  and  approved  or  potential  therapeutic 
applications  of  cytokine  agonists  and  antagonists. 


Family 

Class  1  cytokine  receptors 

Examples  Activation  Characteristics 

CH 

GHR.  EPOR.TPOR,  PRLR,  C- 

homodimers 

receptor 

CSFR 

IL-3 

IL-3R,  GM-CSFR,  IL-5R 

heterodimerization  with  f$c 

receptor 

IL-6 

IL-6R,  LIFR,  CNTFR,  IL-11  R 

heterodimerization  with 

receptor 

gpl30 

IL-2 

IL-2Ra,  IL-2RP,  IL-4R,  IL-7R 

heterodimerization  with 

receptor 

IL-2Ry 

Therapeutic  applications  of  cytokine  agonists  and  antagonists 

Cytokine 

Agonist 

Antagonist 

EPO 

anemias,  selective  blood  dona- 

TPO 

tion,  cancer 
thrombocytopenia 

IL-2 

cancer 

histoincompatibility 

IL-3 

leukopenia,  myeloid  reconstitu- 

leukemia 

IL-4 

tion 

inflammation,  cancer 

allergy 

IL-6 

thrombocytopenia 

cancer,  osteoporosis, 

IL-11 

thrombocytopenia 

inflammation 

IL-12 

cancer,  infections 

histoincompatibility,  auto- 

C-CSF 

neutropenia,  myeloid  reconsti- 

immunity 

leukemia 

GM-CSF 

tution 

leukopenia,  myeloid  reconstitu- 

leukemia 

IFN  a/p 

tion 

cancer,  viral  infections,  autoim- 

inflammation 

IFN  y 

munity 

chronic  granulonatous  disease, 

inflammation,  autoim- 

infections 

munity 

ment  of  STATs  results  in  their  phosphorylation,  homo-  or 
heterodimerization,  and  subsequent  translocation  to  the 
nucleus  where  they  directly  activate  gene  transcription.  This 
is  the  most  direct  signal-transduction  pathway  known  and 
beautifully  enlists  coordinated  protein-protein  homo-  or 
heterodimerization  of  signaling  molecules  to  link  the  initia¬ 
tion  receptor-dimerization  event  directly  to  gene  transcrip¬ 
tion.  Notably,  receptor  homodimerization  enlists  JAK  and 
STAT  homodimers  (JAK2/JAK2  and  STAT5/STAT5  for 
EPOR)  that  bind  a  unique  DNA  sequence  and  upregulate 
the  target  genes,  whereas  receptor  heterodimerization  enlists 
JAK  and  STAT  heterodimers  (i.e.,  STAT4/STAT5)  that  target 
a  different  DNA  sequence  for  upregulating  a  different  set  of 
targets  genes.  Thus,  this  combinatorial  dimerization  strategy 
is  enlisted  in  the  natural  signaling  cascade  to  maximize  the 
diversity  of  signals  with  a  limited  number  of  proteins. 

The  approach  we  adopted  is  generalizable  to  all  cytokine 
receptors.  Because  of  the  scale  on  which  the  synthesis  is 
conducted,  the  libraries  could  be  enlisted  to  screen  for 
agonists  or  antagonists  against  multiple  receptors.  Although 
not  an  objective  of  present  work,  their  availability  should 
allow  such  studies  in  the  future.  As  detailed  earlier,  homo-  or 
heterodimerization  linkage  of  ligand  binding  antagonists  can 
be  enlisted  to  develop  ligand  agonists.'170'  Potential  applica¬ 
tions  of  such  studies  with  the  cytokine  receptors  are 
summarized  in  Table  4. 
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Figure  11.  Top:  EPO  agonists  (mimics)  that  function  by  promoting 
EPOR  dimerization.  Bottom:  Development  of  EPO  agonist  30  which  is 
active  in  cellular  functional  assay,  step  1 :  over  100000  molecules  were 
screened  for  EPOR  binding,  the  B  subunit  is  shown  to  be  the  most 
critical  for  binding  activity.  Step  2 :  A  library  of  1 1 5  A  analogues  pro¬ 
vided  an  improved  lead  structure.  The  B  subunit  features  contributing 
to  activity  were  identified  (77  analogues).  A  library  of  70  C  analogues 
provided  an  improved  lead  structure,  three  1000-member  libraries  with 
alternative  templates  were  examined.  Higher  order  IDAs  examined: 
binders  examined  in  functional  assay. 


The  disclosure  of  a  20-amino  acid  cyclic  peptide 
RWJ  61233  (EMP1)  identified  in  a  phage  library  that  binds 
to  and  activates  EPOR  (KD  =  200  nM  versus  KD  =  200  pM  for 
EPO)1’19,3201  along  with  the  X-ray  structure  determination  of 
the  peptide-receptor  complex13211  revealed  in  detail  how  two 
molecules  of  the  peptide  dimerize  EBP  (EPO  binding 
protein,  the  extracellular  domain  of  EPOR;  Figure  12  and 
13).'3221  The  structure  of  the  complex  revealed  a  peptide  dimer 
flanked  by  two  molecules  of  the  EBP  and  also  reveals  a 
striking  structural  homology  to  the  growth  hormone  and 
prolactin  receptors.1323'3241  Interestingly,  in  the  stem  regions 
the  two  molecules  of  EBP  have  very  little  contact  (ca.  75  A2) 
and,  contrary  to  expectations,  the  WSXWS  motif  is  not 
involved  in  ligand  binding  nor  located  in  or  near  the  ligand 
binding  site  or  dimerization  interface.  Consistent  with  muta¬ 
tional  studies  of  growth  hormone  and  its  receptor1325-328'  which 
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Figure  12.  Structure  of  EPO  mimetic  peptide  (EMP1),  summary  of  rela¬ 
tive  EPOR  binding,  and  summary  of  key  contacts  derived  from  the 
X-ray  crystal  structure  of  the  complex  with  EBP  (EPOR). 


revealed  that  the  majority  of  the  intermolecular  binding 
stabilization  is  derived  from  a  few  residues  in  the  contact 
interface,  the  dimeric  peptide  binds  EBP  drawing  upon  a 
limited  number  of  contacts  for  binding  and  dimerization. 
Further  support  for  the  key  contacts  has  been  derived  through 
modifications  of  the  cyclic  peptide  (Figure  12).'319'320'  This 
result  suggested  that  relatively  small  ligands  may  be  devised 
as  EPO  agonists  even  though  the  cyclic  peptide  dimer  itself  is 
quite  large.  The  cyclic  peptide  contains  a  single  disulfide  bond 
Cys6/Cysl5  which  links  two  short  (1-strands  (residues  4-7  and 
13-16)  that  are  connected  by  a  type  I  (3-turn  (Gly9-Prol0- 
Leull-Thrl2).1329'  The  dimerized  cyclic  peptide  forms  a 
hydrophobic  clamp  that  slides  into  a  receptor  dimerization 
interface  making  contacts  with  both  receptor  molecules. 
Central  to  this  binding  is  a  hydrophobic  interface.  Pro  149- 
Hisl53,  which  houses  residues  engaged  in  main-chain  and 
side-chain  hydrogen  bonds  to  Gly9-Thrl2/Trpl3  of  the 
peptide.  Critical  and  also  central  to  the  binding  dimerization 
is  the  receptor  Phe  93  which  provides  key  contacts  with  Trp  13 
and  Tyr4  complementing  the  adjacent  Ser92/Tyr4-OH  hydro¬ 
gen  bond.  Similarly,  Phe  8  spans  the  binding  sites  defined  by 
Leu  33,  Met  150,  and  Phe  205  which  contributes  significantly 
to  the  stabilization  of  the  dimer  complex.  Our  candidate 
library  agonists  were  selected  on  the  basis  of  maintaining  and 
spanning  these  critical  interactions.  Although  space  precludes 
a  detailed  description  of  the  modeling  efforts  that  went  into 
the  design  of  our  initial  libraries,  it  is  perhaps  sufficient  to 
indicate  that  effective  EPOR  binding  ligands  were  obtained 
in  the  first  libraries  and  their  further  exploration  has  provided 
even  better  binders. 
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Figure  13.  Top:  X-ray  structure  of  the  EBP-EMP1  complex. 
Bottom:  Key  residues  involved  in  the  peptide-EBP  interaction. 


Employing  libraries  prepared  from  iV-Boc-iminodi acetic 
acid  anhydride  and  a  three-  or  four-step  solution-phase 
reaction  with  acid-base  extractions  for  purification,  numer¬ 
ous  libraries  were  prepared  (roughly  100000  compounds)  and 
evaluated  for  EPOR  binding.  A  number  of  EPOR  binding 
agents  were  identified  that  inhibited  I12S-EPO  binding  to 
EPOR  (IC5n  =  1-80  pM).  These  antagonists  of  EPO  binding 
were  covalently  linked  to  provide  C2  symmetric  dimers 
capable  of  receptor  dimerization  and  activation.  Several  of 
the  most  active  compounds  were  found  to  be  weak  agonists 
and  induced  concentration-dependent  proliferation  of  an 
EPO-dependent  cell  line  (UT-7/EPO)  exhibiting  the  charac¬ 
teristic  bell-shaped  dose  response  curve  while  having  no 
effect  on  a  similar  cell  line  (FDC-P1)  lacking  the  EPOR 
(Figure  11  and  14).[8Z1 

To  date,  only  three  reports  of  EPO  agonists  have  been 
disclosed182,319,3301  and  those  reported  above1821  are  only  about 
10-fold  less  potent  than  EMP1[3191  (IC5n  =  5-10  versus  0.5  (im). 
They  are  substantially  more  efficacious  than  a  recently 
reported  Merck  agonist13301  (15-20%  versus  4%  of  activity 
of  EPO),  considerably  smaller  than  EMP1  (M,ss  2100,  but 
functions  as  dimer  of  M,  ~  4200)  or  the  Merck  agonist  (Mr~ 
6400),  and  are  substantially  more  accessible  (four  synthetic 
steps).  Key  binding  elements  have  been  identified  allowing 
for  further  potential  reductions  in  the  size  of  the  identified 
leads. 

Since  the  agents  act  on  the  extracellular  domain  of  the 
receptor,  they  need  not  be  permeable  to  the  cell.  As  such,  the 
relative  size  of  the  agonist  structures  need  not  be  as  significant 


a  concern  as  it  is  with  an  intracellular  target.  Similarly,  one 
can  anticipate  that  the  amide  bonds  present  in  the  existing 
leads  can  be  engineered  out  of  the  structures  (Figure  14). 
Finally,  one  can  anticipate  that  the  size  of  the  agonist 
structures  may  be  reduced  and  their  affinity  increased 
through  examination  of  the  X-ray  structures  of  complexes 
with  EBP.  These  studies  should  allow  the  design  of  additional 
key  interactions  into  the  leads,  the  trimming  away  of  non¬ 
productive  elements  of  the  compounds,  and  permit  the  design 
of  rigid  structures  embodying  the  EBP  bound  conformation. 
Finally,  it  is  now  clear  that  simple  dimerization  of  the  EPO 
receptor  is  insufficient  for  receptor  activation.1170,316,331'3341 
This  dimerization,  or  conformational  reorganization  of  a 
cell  surface  receptor  dimer,  must  occur  in  a  manner  that 


Figure  14.  Concentration-dependent  proliferation  activity  in  an  EPO- 
dependent  cell  line  exhibiting  a  bell-shaped  dose-response  curve. 
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brings  the  receptor  cytoplasmic  domains  in  close 
proximity  to  one  another  permitting  the  JAK  cross 
phosphorylation  and  initiation  of  the  signaling 
cascade.  Thus,  rigid  variants  of  the  lead  structures 
that  not  only  bind  the  ligand  site  and  stabilize 
receptor  dimerization,  but  that  also  more  effec¬ 
tively  induce  the  active  conformation  of  the 
receptor  dimer  may  help  convert  the  partial 
agonists  into  full  agonists. 


5.  Targeting  Protein-DNA  Interactions 

Studies  on  the  high-throughput  synthesis  and 
screening  of  DNA  binding  compounds172,84-881  have 
also  been  detailed  that  are  related  to  interests  in 
understanding  and  exploiting  their  properties. 189-1,61 
Our  approach  integrates  the  solution-phase  tech¬ 
niques  for  the  synthesis  of  libraries  with  a  techni¬ 
que  we  introduced  for  rapid  high-throughput 
screening  for  DNA  binding  affinity  or  sequence 
selectivity)87'97,,81  These  techniques  can  be  com¬ 
bined  to  rapidly  explore  and  define  the  structural 
features  responsible  for  the  sequence-selective 
DNA  binding  properties  of  known  agents,  to 
discover  new  patterns  for  small-molecule  recogni¬ 
tion  of  DNA  (new  base-pair  (bp)  codes),  and  to 
screen  for  compounds  that  selectively  target  con¬ 
sensus  sequences  of  transcription  factors  for  con¬ 
trolling  aberrant  gene  transcription  (i.e. ,  LEF-l/p- 
catenin,  androgen  receptor). 

The  regulation  of  gene  expression  is  based 
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Scheme  8.  Solution-phase  synthesis  of  distamycin  A.  Optimized  yields  in  parenthe¬ 
ses.  All  the  compounds  were  obtained  in  >95%  purity  after  acid-base  extraction, 
Chromatographic  purification  was  not  used.  DMAP  =  4-N,N-dimethylaminopyridine, 
Boc-Pyr-OH  =  31,  EDCI  =  l-ethyl-3-(3-dimethylaminopropyl)carbodiimide, 

CDI  =  1  ,l'-carbonyldiimidazole. 


on  the 

sequence-selective  recognition  of  nucleic  acids  by  repressor, 
activator,  and  enhancer  proteins.  Selective  control  of  such 
processes  has  been  a  long-standing  goal  and  small  molecules 
that  selectively  bind  DNA  and  activate  (block  a  repressor)  or 
inhibit  (block  an  activator)  gene  expression  hold  great 
promise  as  therapeutics)335-356'  Complicating  such  studies  is 
the  recognition  that  it  is  often  not  a  single  sequence  that  is 
ideally  targeted,  but  rather  an  ensemble  of  related  sites  that 
compose  the  consensus  binding  sequence  of  a  nuclear 
receptor  or  transcription  factor.  Consequently,  the  discovery 
of  such  agents  has  been  slow  owing  to  the  complexity 
associated  with  understanding  small-molecule/DNA  interac¬ 
tions,  the  iterative  process  of  designing  and  synthesizing 
individual  compounds  targeted  towards  specific  DNA 
sequences,  and  the  technically  demanding  techniques 
involved  in  the  determination  of  their  binding  affinity  for 
any  given  sequence,  much  less  an  ensemble  of  related 
sequences. 


5.t.  High-Throughput  Synthesis  of  DNA-Binding  Compounds 
Using  Solution-Phase  Library  Techniques 
5.1.7.  Distamycin  and  a  Library  of 2640  Analogues 

A  rapid,  parallel  synthesis  of  distamycin  A  analogues  was 
developed  enlisting  the  simple,  acid-base  liquid-liquid 
extraction  for  purification  and  isolation  of  each  intermediate 


and  final  product  (Scheme  8).  The  utility  of  this  method  was 
first  demonstrated  with  the  preparation  of  distamycin  A 
(eight  steps,  40%  overall  yield),  a  naturally  occurring 
antitumor  antibiotic,  where  each  of  the  reaction  products 
was  isolated  in  at  least  95  %  purity)871  A  prototypical  library  of 
2640  analogues  was  assembled  in  a  small-mixture  format  of 
two  libraries  of  132  mixtures  of  10  compounds  providing  each 
compound  in  multimilligram  quantities  sufficient  for  repeti¬ 
tive  screening  in  multiple  assays  (Scheme  9).  Screening  the 
initial  library  in  a  functional  assay  for  cytotoxic  activity 
(L1210)  revealed  two  uniquely  active  compounds  which  were 
found  to  be  1000-times  more  potent  than  distamycin  A)871  A 
complementary  rapid,  high-throughput  screen  for  DNA 
binding  affinity  was  also  developed  which  is  applicable  for 
assessing  binding  to  DNA  homopolymers  or  specific  sequen¬ 
ces  (discussed  in  Section  5.2).  Using  this  technique,  alterna¬ 
tive  AT-rich  binding  agents  were  identified  establishing  the 
usefulness  of  the  approach  and  providing  two  new  and 
effective  DNA-binding  distamycin  A  analogues.  In  addition, 
a  comparison  of  several  distamycin  analogues  established 
substituent  contributions  to  AT-rich  binding  that  may  be 
safely  implemented  in  future  libraries.  Extension  of  these 
studies  to  identify  effective  binders  to  predefined  sequences 
was  conducted  in  the  context  of  the  androgen  response 
elements  PSA-ARE-3  and  the  ARE-consensus  sequences, 
the  latter  of  which  contains  an  AT-rich  sequence  interrupted 
by  a  GC  base  pair,  and  for  which  effective  binders  might 
prove  useful  in  the  treatment  of  resistant  prostate  cancer. 


©  2003  Wiley-VCH  Verlag  GmbH  &  Co.  KGaA,  Weinheim 


www.angewandte.org 


Angew.  Che/v.  Int.  Ed.  2003,  42,  4138-4176 


Combinatorial  Chemistry 


Angewandte 

Ckemie 


H 

V' 

o 


9  « 

N 

Me 


//  \ 

N 

Me 


Solution  phase  combinatorial 
chemistry  using  10-12  different 
heterocyclic  amino  acids  and 
liquid-liquid  purification  protocols 


// 

V 

Me 


10  substitutions 
-11  substitutions 
-12  substitutions 


N^yNH2 
6  NH 


First  generation  libraries 


1320  compounds  (10  x  1 1  x  12) 
(132  mixtures  of  10  compounds) 


10  compounds  with 
<  10  PM, 

six  contain  CDPl 
in  key  position  A. 

ICM  of  the  three  most 
potent  compounds 
BOC-B5-A7-G8I  2pM 
Boc-B4"A7*CBt  3  pM 
Boc-'B3"A7~CBI  5pM 
CC-1055:  20  pM 


Functionalize  amine  terminus 
with  a  basic  side  chain 


Me 


Me 


36 


Second  generation  libraries:  1320  compounds 

increased  affinity  (132  mixtures  of  10  compounds) 


Scheme  g.  Solution-phase  synthesis  of  a  library  of  2640  potential 
DNA-binding  distamycin  analogues. 


5.7.2.  CC-1065  and  the  Duocarmycins:  A  132-Membered  Library 
Defining  the  Role  of  the  DNA-Binding  Domain 

A  library  of  132  CBI  (l,2,9,9a-tetrahydrocyclopropa[c]- 
benzo[e]indoI-4-one)  analogues  of  CC-1065  and  the  duocar¬ 
mycins,  potent  antitumor  antibiotics  that  alkylate  duplex 
DNA,  was  prepared  utilizing  the  solution-phase  technology  of 
acid-base  liquid-liquid  extraction  for  the  isolation  and 
purification  steps  (Scheme  10)J881  The  132  analogues  consti- 


11 


132  compounds  (12  x  11) 

Scheme  10.  Solution-phase  synthesis  of  132  CC-1065  analogues  based 
on  CBI.  All  the  compounds  were  purified  by  liquid-liquid  acid-base 
extraction. 


tuted  a  systematic  study  of  the  DNA  binding  domain  with  the 
incorporation  of  dimers  composed  of  monocyclic,  bicyclic, 
and  tricyclic  (hetero)aromatic  subunits.  From  their  examina¬ 
tion,  clear  trends  in  cytotoxic  potency  and  DNA  alkylation 
efficiency  emerged  highlighting  the  importance  of  the  first- 
attached  DNA-binding  subunit  (A  subunit):  tricyclic  is  more 
active  than  bicyclic,  which  in  turn  is  far  more  active  than 
monocyclic  (hetero)aromatic  subunits  (Figure  15).  Notably, 
the  trends  observed  in  the  cytotoxic  potencies  paralleled 
those  observed  in  the  relative  efficiencies  of  DNA  alkylation. 
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Figure  7j.  LI  21 0  Cytotoxic  activity  of  the  CC-1065  analogues. 


Our  interpretation  of  these  results  is  that  the  trends  represent 
the  partitioning  of  the  role  of  the  DNA-binding  subunit(s) 
into  two  distinct  contributions.  The  first  of  which  is  derived 
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from  an  increase  in  DNA-binding  selectivity  and  affinity 
which  leads  to  property  enhancements  of  10-100-fold  and  is 
embodied  in  the  monocyclic  series.  The  second,  which  is 
additionally  embodied  in  the  bicyclic  and  tricyclic  heteroar¬ 
omatic  subunits,  is  a  contribution  to  the  catalysis  of  the  DNA 
alkylation  reaction  that  provides  additional  enhancements  of 
100-1000-fold.  The  total  enhancement  thus  exceeds  25000- 
fold.  Aside  from  the  significance  of  these  observations  in  the 
design  of  future  CC-1065/duocarmycin  analogues,  their 
importance  to  the  design  of  hybrid  structures  containing  the 
CC-1065/duocarmycin  alkylation  subunit  should  not  be 
underestimated.  Those  that  lack  an  attached  bicyclic  or 
tricyclic  A  subunit,  that  is,  duocarmycin/distamycin  hybrids, 
can  be  expected  to  be  intrinsically  poor  or  slow  DNA 
alkylating  agents. 

5.7.3.  A  Prototype  Triostin  A  Analogue  Solution-Phase  Synthesis 

A  solution-phase  synthesis  of  azatriostin  A,  a  key  ana¬ 
logue  of  the  DNA  bisintercalator  triostin  A  (Scheme  11),  was 
developed  such  that  libraries  of  triostin  A  analogues  could  be 


O  Me  : 


Scheme  11.  Triostin  A:  naturally  occurring  DNA  bisintercalator. 

prepared.1721  This  work,  in  conjunction  with  the  distamycin  A 
and  CC-1065  analogue  libraries,  highlights  the  ease  with 
which  natural  product  libraries  targeting  DNA  can  be 
constructed  using  solution-phase  technology  to  rapidly  iden¬ 
tify  the  key  structural  components  of  the  natural  product  as 
well  as  to  discover  and  develop  new  therapeutics. 

5. 2.  The  FID  Assay:  A  High-Resolution,  High-Throughput  Assay 

for  Establishing  DNA-Binding  Affinity  and  Selectivity 

A  variety  of  techniques  are  commonly  used  to  establish 
the  DNA-binding  properties  of  small  molecules.13571  Most  are 
technically  challenging  and  require  the  knowledge  of  speci¬ 
alized  biochemical  procedures  and  assay  reproducibility 
comes  only  with  experience.1 1358-359!  Prior  to  our  efforts,  none 
were  applicable  to  the  high-throughput  screening  required  for 
assaying  a  library  of  compounds  against  an  ensemble  or 
complete  library  of  sequences.  The  most  widely  utilized 
methods  are  footprinting  or  affinity  cleavage  and  both  have 
been  used  qualitatively  and  quantitatively  to  establish  DNA- 
binding  selectivity  or  affinity.  Intrinsic  in  the  methods 
employed  in  the  destructive  assays  is  the  binding  character¬ 
ization  at  the  highest  affinity  sites  within  an  incomplete  set  of 
sites  contained  in  a  100-200-bp  segment  of  DNA,  and  modest- 
or  low-affinity  sites  are  not  easily  probed.  As  a  consequence, 


the  full  DNA-binding  profile  of  candidate  ligands  is  not  easily 
assessed  or  compared  using  these  techniques. 

The  nondestructive  fluorescent  intercalator  displacement 
(FID)  assay,  which  addresses  these  issues  of  ease  of  use  and 
information  content,  utilizes  the  displacement  of  ethidium 
bromide  (or  thiazole  orange)  from  hairpin  deoxyoligonucleo- 
tides  (Figure  16).|72RT|  A  hairpin  deoxyoligonucleotide  con- 


addition  of  a  fluorescent 
DNA  intercalator  (e.g, 
ethidium  bromide  or 
thiazole  orange) 


10-20  fold  increase 
in  fluorescence 


decrease  in  relative  fluorescence 
indicates  DNA  affinity 


Figure  16.  Fluorescent  intercalator  displacement  (FID)  assay.  The 
method  is  suitable  for  screening  a  library  to  identify  a  single  com¬ 
pound  with  affinity  for  a  given  DNA  sequence,  or,  for  screening  DNA 
libraries  to  establish  the  sequence  preference  of  a  DNA  binding  agent. 

taining  the  sequence  of  interest,  or  a  library  of  hairpin 
deoxyoligonucleotides  containing  all  four  bp  (136  hairpins)  or 
five  bp  (512  hairpins)  sequences  individually  displayed  in  96- 
well  plates  are  treated  with  the  intercalator,  yielding  a 
fluorescence  increase  upon  DNA  binding.13601  Addition  of  a 
DNA-binding  compound  results  in  a  decrease  in  fluorescence 
owing  to  displacement  of  the  bound  intercalator.  The 
decrease  in  fluorescence  is  directly  related  to  the  extent  of 
DNA  binding,  which  provides  relative  DNA  binding  affinities 
and  a  rank-order  binding.  This  rank-order  binding  profile  can 
be  used  in  the  assay  of  a  library  of  compounds  against  a  single 
sequence  to  identify  the  best  candidate  ligands  for  a  targeted 
sequence.  Alternatively,  it  can  entail  the  assay  of  a  single 
compound  against  a  library  of  hairpin  DNAs  containing  all 
possible  sequences  thereby  establishing  a  high-resolution 
profile  of  its  DNA-binding  selectivity. 
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5.2.1.  Assaying  Libraries  of  Compounds  Against  a  Target 
Sequence  Exemplified  with  the  Discovery  of  Inhibitors  of 
Androgen  Receptor-Mediated  Cene  Transcription 

The  FID  assay  can  be  used  to  rapidly  screen  a  library  of 
compounds  against  any  target  sequence  or  ensemble  of 
sequences  in  a  96-well  format.  This  approach  was  utilized  to 
identify  potent  and  selective  inhibitors  of  androgen  receptor- 
mediated  gene  transcription  (Figure  17).  The  emergence  of 
hormone-independent,  constituently  active  androgen  recep¬ 
tor  dimer,  which  is  unresponsive  to  competitive  antagonist 
treatment,  is  responsible  for  prostate-cancer  relapse  that  is 
resistant  to  chemotherapeutic  treatment.  A  potentially  effec¬ 
tive  treatment  for  such  resistant  prostate  cancer  could  entail 
administration  of  a  DNA-binding  agent  selective  for  the 
androgen  response  element  (ARE)-consensus  sequences. 
Such  a  treatment  would  competitively  inhibit  the  constitu- 
tively  active  androgen  receptor  DNA  binding  and  its  tran¬ 
scription  activation. 

We  used  the  FID  assay  with  two  hairpin  deoxyoligonu- 
cleotides  containing  two  related  sequences  of  the  androgen 
response  element  (ARE),  the  14-bp  ARE-consensus  and 
PSA-ARE-3  sequences.  The  latter  sequence  contains  a  five- 
bp  AT-rich  sequence  that  could  be  anticipated  to  bind 
distamycin-like  structures  while  the  former  contains  a  similar 
sequence  interrupted  by  a  GC  bp.  In  the  screening  of  our 
library,  several  candidates  were  identified  which  bound  the 
AT-rich  site  of  the  PSA-ARE-3  sequence.  One  analogue  (49, 
Figure  17)  maintained  the  high-affinity  binding  to  the  ARE- 
consensus  sequence.  Compound  49  appeared  suitable  as  an 
inhibition  of  the  gene  transcription  initiated  by  hormone 
insensitive  androgen-receptor  dimerization  and  DNA  binding 
characteristic  of  therapeutic-resistant  prostate  cancer.1*71 
Notably,  49  exhibited  potent  (IC5n  =  8  nM)  and  selective  (ca, 
40-fold)  inhibition  of  androgen  receptor-mediated  gene  tran¬ 
scription  in  a  cell-based  reporter  assay,  albeit  requiring 
liposome  delivery  of  the  compound  for  cell  penetration  and 
observation  of  the  activity. 

5.2.2.  High-Resolution  Definition  of  Sequence  Selectivity 

This  FID  assay  is  especially  powerful  for  establishing  the 
complete  sequence  selectivity  of  a  DNA-binding  agent.  The 
technique  entails  the  measurement  of  the  loss  of  fluorescence 
derived  from  the  displacement  of  ethidium  bromide  from 
hairpin  deoxyoligonucleotides  containing  all  possible  five 
(512  hairpins)  or  four-bp  sequences  (136  hairpins)  displayed 
in  a  96-well  format  (Figure  18).  In  selected  instances,  the 
change  in  fluorescence  of  the  DNA-binding  compound  itself 
can  be  monitored.  This  includes  characterization  of  modest 
and  nonbinding  sequences  not  accessible  by  other  techniques 
(i.e.,  footprinting),  and  such  binding  profiles  may  prove 
instrumental  in  distinguishing  candidate  ligands  that  display 
selectivity  for  an  ensemble  of  related  sequences  characteristic 
of  the  consensus  sequences  of  transcription  factors.  Technical 
issues  associated  with  the  use  of  the  assay  were  investigated, 
multiple  compounds  of  varying  affinities  were  examined,  and 
its  use  in  comparing  the  DNA  binding  properties  of  distamy- 
cin  A,  netropsin,  DAPI,  Hoechst  33258,  and  berenil  were 
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Figure  17.  FID  assay  used  in  the  screening  for  inhibitors  of  androgen 
receptor  (AR)  mediated  gene  transcription  by  evaluating  the  affinity  to 
PSA-ARE3  and  ARE-consensus  sequences. 


disclosed.1971  The  cost  of  the  512  hairpins  for  a  single  assay  is 
about  $100  (ethidium  bromide)  or  less  (for  thiazole 
orange).1981  The  technique  is  nondestructive  indicating  that 
immobilization  of  the  hairpins  (chips,  beads,  glass  slides) 
could  permit  their  rinsing  and  reuse  in  subsequent  assays.  This 
hairpin  immobilization  and  reuse  would  remove  the  barrier  to 
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512  D\'A  Sequences 


Decreasing  Affinity  — *► 


Figure  18.  Determination  of  distamycin  A  sequence  selectivity  in  one 
experiment  by  using  the  FID  assay.  The  relative  affinity  for  a  given 
DNA  sequence  is  proportional  to  the  decrease  in  fluorescence.  The 
affinity  decreases  with  increasing  AT  content.  The  absolute  binding 
constants  obtained  are  comparable  to  those  obtained  with  other  tech¬ 
niques. 


examining  libraries  of  longer  sequences  (>5  bp)  and  extend 
the  use  of  the  technology  to  binding  site  sizes  typical  of 
proteins. 

5.2.3.  Scatchard  Analysis  of  Titration  Curves:  Absolute  Binding 
Constants  and  Stoichiometry  of  Binding 

The  FID  assay  may  be  used  to  establish  binding  constants 
for  any  given  sequence  through  quantitative  titration.  Several 
methods  of  establishing  binding  constants  were  examined  and 
the  most  reliable  is  a  Scatchard  analysis  of  the  titration 
binding  curve  which  also  determines  the  stoichiometry  of 
binding  (Figure  19)J971  Direct  measurement  of  binding  con¬ 
stants  of  fluorescent  DNA-binding  compounds  (DAPI, 
Hoechst  33258)  compared  favorably  to  those  derived  from 
the  indirect  displacement  of  ethidium  bromide  and  thiazole 
orange  (Table  5).  As  a  demonstration  of  the  power  of  this 
technique,  the  absolute  binding  constants  for  netropsin  with 
each  four-bp  AT  sequence  were  determined  establishing  its 
full  relative  and  absolute  selectivity  (Table  6).[9S1 

5.2.4.  Rapid  Characterization  of  DNA  Binding  Modes 

The  FID  assay  has  been  utilized  to  distinguish  between 
and  establish  binding  modes  and  binding-site  sizes.  This  work 
has  permitted  the  first  characterization  of  a  novel  substituted 
(I-alanine  hairpin  polyamide  and  the  cooperative  2:1  side-by- 
side  parallel  (versus  antiparallel)  binding  of  iminodiacetic 


Figure  19.  Top:  Titration  of  netropsin  versus  the  hairpin  containing 
5'-AATTT-3'  at  1.1  pM  (S.8  gM/bp),  the  intersection  is  at  1.05  equiva¬ 
lents  which  indicates  a  1:1  binding  and  bottom:  Scatchard  plot 
(Slope  =  -K). 


Table  j:  Binding  constants  of  DAPI  and  Floechst  33258. 


A.  Ethidium  bromide  and  thiazole  orange  binding  constants 

Hairpin 

Keb  [x  106  m' 

-T 

^TO 

(x106m-’]w 

5'-AAAAA-3' 

0.27  (4.5)|b| 

1.2  (3.5)'b> 

5'-AATTT-3' 

0.42  (3.6) 

1.3  (3.3) 

5'-ATTAA-3' 

0.87  (3.3) 

1.0  (3.3) 

5'-AATAA-3' 

0.73  (3.3) 

1.1  (3.3) 

DAPI  binding  constants 

DNA  Sequence 

iqxio'M-'f1 

* 

X 

O 

K'[x106m~1]H 

5'-AATTT-3' 

58 

(0.91) 

110 

(0.94) 

120  (1.1) 

5'-AATAA-3' 

26 

(0.83) 

59 

(0.98) 

87  (0.94) 

5'-ATTAA-3' 

24 

(0.83) 

52 

(0.88) 

77  (1.0) 

5'-AAAAA-3' 

n.d. 

'A  n.d/l 

50 

(0.95) 

65  (0.99) 

Hoechst  33258  binding  constants 

DNA  Sequence 

k[xl06M-'f 

K[xl0( 

/qxlO'M-’]1'1 

5'-AATTT-3' 

75 

(0.90) 

88 

(0.96) 

177  (0.98) 

5'-AAAAA-3' 

36 

(0.86) 

72 

(0.98) 

145  (0.92) 

5'-AATAA-3' 

28 

(0.77) 

34 

(1.06) 

83  (1.01) 

5'-ATTAA-3' 

13 

(0.89) 

19 

(0.91) 

47  (1 .05) 

[a]  Established  for  the  hairpin  deoxyoligonudeotides  containing  the 
indicated  sequence  by  direct  titration,  [b]  Stoichiometry  of  binding 
established  experimentally,  [c]  Thiazole  orange  displacement  (experi¬ 
mental  stoichiometry  of  binding),  [d]  Ethidium  bromide  displacement 
(experimental  stoichiometry  of  binding),  [e]  Direct  fluorescence  titration 
(experimental  stoichiometry  of  binding),  [f]  n.d.  =  not  determined. 


acid  linked  pyrrole  polyamides.  Polyamides  composed  of  N- 
methylpyrrole,  N-methylimidazole,  and  a  growing  set  of 
structural  analogues  bind  in  the  DNA  minor  groove  with 
predictable  sequence  selectivity  and  high  affinity.^350'3*1"363' 
The  head-to-tail  linkage  of  such  polyamides  with  the  five- 
atom  linker  y-aminobutyric  acid  (y)  has  been  shown  to 
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Combinatorial  Chemistry 


Table  6:  Comparative  binding  constants. 


Titrant 

DNA 

Sequence 

fC[xl06M_1] 

Binding  stoichiom¬ 
etry 

ethidium  bro¬ 
mide 

5'-AATT 

2.7 

2.92 

tbiazole  orange 

5'-AATT 

15 

3.05 

Titrant 

DNA 

Sequence 

Ethidium  bro¬ 
mide 

K[x106m-'] 

Thiazole  orange 

K’[x106m"1] 

Netropsin 

5'-AAAT 

127 

113 

5'-AAAA 

92 

71 

5'-AATT 

65 

54 

5'-AATA 

64 

44 

5'-ATTA 

45 

35 

5'-ATAT 

41 

33 

5'-ATAA 

34 

18 

5'-TAAA 

26 

15 

5'-TTAA 

11 

8 

5'-TATA 

11 

8 

provide  hairpin  polyamides  that  mimic  the  2:1  side-by-side 
antiparallel  binding  of  the  unlinked  polyamides  (Figure  20), 
enhance  the  binding  affinity  100  to  10000-fold,  and  improve 
the  binding  selectivity.1364'  In  contrast,  polyamides  incorporat¬ 
ing  a  one-carbon-atom  shorter  head-to-tail  linker,  (3-alanine 
(|3),  bind  preferentially  in  an  extended  conformation  forming 
1:1  or  side-by-side  antiparallel  2:1  complexes.1364  3651  By  using 
the  FID  assay  and  a  series  of  DNA  hairpins  containing  a  AT- 
rich  binding  site,  the  size  of  which  was  systematically  varied, 
the  stoichiometry  of  binding  (1:1  vs  2:1)  and  the  binding  mode 
(hairpin  vs  extended)  for  such  distamycin-derived  polyamides 
can  be  rapidly  and  easily  established  (Figure  20).183’841 

Using  this  approach,  appropriate  substitution  of  a  (3- 
alanine  linker  was  established  to  induce  hairpin  instead  of 
extended  binding  providing  an  alternative  hairpin  linker  to  y 
(Figure  21).183'841  These  polyamide  analogues  were  obtained 
using  a  parallel  solution-phase  approach  involving  a  series  of 
EDCI-mediated  coupling  reactions  (Scheme  12). 

As  a  natural  extension  of  these  studies,  the  behavior  of 
head-to-head  iminodiacetic  acid  (IDA)  linked  polyamides 
was  characterized.  Not  only  was  IDA  the  template  on  which 
many  other  libraries  had  been  generated  (see  Section  3),  but 
it  constituted  a  unique  five-atom  linker  bearing  an  additional 
functionalization  site  at  the  central  nitrogen  atom  of  its 
structure,  and  represented  an  unusual  head-to-head  as 
opposed  to  head-to-tail  linkage  of  the  polyamides.  Tire 
candidate  IDA-linked  pyrrole  polyamides  were  prepared 
with  solution-phase  synthesis  methodology  (Scheme  13)  and 
found  to  bind  in  a  unique  manner  exhibiting  cooperative 
extended  2:1  binding  across  a  9-10-bp  site  in  what  appears  to 
be  the  first  well-characterized  example  of  parallel,  side-by- 
side  binding  of  such  polyamides  (Figure  22). 1841 

5.3.  LEF-1  /P-Catenin 

A  second  class  of  inhibitors  of  LEF-l/fS-catenin  mediated 
gene  transcription  (see  Section  4.4  for  the  first  class)  were 
derived  from  the  libraries  of  DNA  minor-groove-binding 


ligands.1871  The  unpublished  results  of  the  screening  of  one 
such  library  are  shown  in  Figure  23  resulting  in  the  identi¬ 
fication  of  two  closely  related  lead  structures  69  and  70  that 
inhibit  TOPFLASH  transcription  10-fold,  exhibit  a  s  three¬ 
fold  TOPFLASH-FOPFLASH  selectivity,  inhibit  LEF-1 
DNA  binding  in  a  gel-shift  assay,  and  inhibit  the  (3-catenin 
induced  transformation  of  the  colon  cancer  cell  line  SW480. 
Their  DNA-binding  selectivities  were  established  in  our  FID 
assay.1871  This  constitutes  only  the  second  example  of  a  small- 
molecule  inhibitor  of  LEF-l/DNA  binding  (protein-DNA 
interaction)  leading  to  inhibition  of  gene  transcription,  the 
first  being  a  polyamide  from  Dervan  et  al.1366  3671 


Natural  product  leads  for  existing  or  new  targets  often 
emerge  as  superb  starting  points  for  drug  development.  The 
merits  of  enlisting  a  natural  product  lead  have  been  chroni¬ 
cled  in  many  reviews  and  monographs,13681  and  may  be 
attributed  to  several  features.  They  have  emerged  from  a 
biological  milieu  in  which  they  express  their  properties 
ensuring  that  they  embody  many  drug-like  qualities.  No 
doubt,  it  is  the  integration  of  many  functions  into  a  single 
compact  structure  that  distinguishes  natural  product  leads 
from  those  created  in  the  laboratory.  As  the  exploration  of  the 
properties  of  complex  natural  products  becomes  increasingly 
more  sophisticated  with  the  technological  advances  being 
made  in  their  screening  and  evaluation  and  as  structural 
details  of  their  interaction  with  biological  targets  becomes 
more  accessible,  the  opportunities  for  providing  unique 
solutions  to  complex  biological  problems  have  grown.  A 
powerful  complement  to  the  examination  of  the  naturally 
derived  agents  themselves  is  the  preparation  and  subsequent 
examination  of  key  partial  structures,  and  agents  containing 
deep-seated  structural  modifications.  Well  conceived  deep- 
seated  structural  modifications  may  be  used  to  address  the 
structural  basis  of  the  natural  product’s  interactions  with 
biological  targets  and  the  define  fundamental  relationships 
between  structure,  functional  reactivity,  and  properties.  In 
these  studies,  the  challenging  problem  is  to  understand  the 
beautiful  solutions  and  subtle  design  elements  that  nature  has 
provided  in  the  form  of  a  natural  product  and  work  to  extend 
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Systematically  van/  the  length  of  the  AT-rich  binding  site 
5'-GCAAAAAC  A  A. 


3'-CGTTTTTG  A  A 


NH(Py)3COzMe 

HN 

CO(Py)3NHDba 


A  5A 


12A 


5-GCAAAAAAAAAAAAC' 
3'-CGT  I  i  I  I  11  IT!  TTG 


A  A 

/ 

A  A 
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OA, 

>  -OMe 
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NH 


O 
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NH 
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Figure  n.  Hairpin  binding  of  a  substituted  P-alanine  linked  polyamide.  Bottom  left:  P-ala- 
nine  extended  antiparallel  2:1  side-by-side  binding.  Bottom  right:  p^^'-alanine:  antiparallel 
1:1  side-by-side  hairpin  binding. 


the  solution  through  rational  design  elements  to  provide  more 
selective,  more  efficacious,  or  more  potent  agents  designed 
specifically  for  the  problem  or  target  under  investigation. 

Still  today,  nine  of  top-20  best-selling  drugs  were  derived 
or  developed  from  natural  product  leads  with  combined  sales 
of  >  $16  billion  (1999).14071  Consequently,  natural  products  are 
arguably  one  of  the  most  successful  approaches  to  modern 
drug  discovery,  and  it  is  not  surprising  that  combinatorial 
methods  have  been  evolved  to  exploit  the  importance  of  such 
leads  (Table  7)J36'M071  The  solution-phase  synthesis  approach 
we  developed  has  proven  successful  for  the  generation  of 
libraries  of  selected  natural  products.  In  addition  to  the 


synthesis  of  2640  analogues  of  distamycin  (Sec¬ 
tion  5.1.1),  132  analogues  of  CC-1065  and  the  duo- 
carmycins  (Section  5.1.2),  and  a  prototype  analogue 
of  triostin  A  (Section  5.1.3),  the  approach  was  used 
to  create  a  library  of  HUN-7293  analogues  exploring 
each  structural  detail  of  the  natural  product.  This 
example  is  one  in  which  the  biological  target  for  the 
natural  product  has  not  been  established.  Even  in 
the  absence  of  a  known  target,  a  functional  screen 
and  a  library  of  analogues  was  used  to  define  the 
pharmacophore  of  the  natural  product. 


6.1.  HUN-7293:  Single-Point  Changes  Introduced  by 
Total  Synthesis  of  a  Library  of  Analogues:  A 
“Chemical”  Mutagenesis  Approach  to  Defining 
Structure-Function  Relationships 

The  naturally  occurring  cyclic  heptadepsipeptide 
HUN-7293  (Scheme  14)  was  first  isolated  in  1992 
from  a  fungal  broth  in  a  screen  for  inhibitors  of 
inducible  cell  adhesion  molecule  expression)4081  Its 
structure  was  subsequently  established  by  'H  NMR 
spectroscopic  methods  and  confirmed  by  X-ray 
analysis.14081  Independently,  the  same  cyclic  depsi- 
peptide  was  isolated  by  a  Japanese  group  from  a 
different  fungal  species  based  on  a  screen  for  anti- 
HIV  compounds. 1401,1  Molecules  that  bind  with  endo¬ 
thelial  cells,  which  include  intercellular  adhesion 
molecule  1  (ICAM-1),  vascular  cell  adhesion  mole¬ 
cule  1  (VCAM-1),  and  E-selectin  play  important 
roles  in  the  immune  response  by  recruiting  and 
regulating  leukocyte  migration  and  cell-to-cell  inter¬ 
actions  at  a  site  of  inflammation.  Modulation  of 
these  interactions  through  the  inhibition  of  such  cell 
adhesion  molecule  expression  should  prove  useful 
for  the  treatment  of  chronic  inflammatory  and 
autoimmune  diseases  characterized  by  their  over¬ 
expression  or  upregulation.  As  detailed  in  recent 
studies,14101  diverse  signals  act  on  endothelial  cells  to 
activate  members  of  the  nuclear  factor  of  the 
Kappa  B  (NF-xB)  transcription  factor  family.  NF- 
xB  family  members  are  cytoplasmic  homo-  or 
heterodimer  proteins  inactive  when  complexed 
with  members  of  a  family  of  inhibitor  proteins, 
IxB.  Phosphorylation  of  IxB,  releasing  the  active 
NF-xB.  results  in  translocation  to  the  nucleus  and 
initiation  of  transcription  of  its  targeted  genes  including  the 
cell  surface  adhesion  molecules.  Few  other  groups  have 
detailed  work  on  small-molecule  inhibitors  of  the  expres¬ 
sion1415^151  of  cell  surface  adhesion  molecules,  aside  from 
antisense141'1  and  antibody14121  approaches,  and  unlike  HUN- 
7293,  each  target  the  NF-xB  signal  transduction  pathway. 

The  first  total  synthesis  of  HUN-7293  was  disclosed  using 
a  convergent  approach  which  served  to  define  several  subtle 
elements  essential  to  its  preparation  including  backbone  ester 
formation  by  a  Mitsunobu  displacement  and  the  identification 
of  MLEU3-LEU4  as  an  effective  macrocyclization  site 
(Scheme  14).174]  The  subsequent  implementation  of  this 
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Scheme  12.  The  parallel  synthesis  of  p-alanine  linked  pyrrole  polya¬ 
mides.  Py=  N-methy  I  pyrrole,  Im  =  N-methylimidazole,  HOBt  - 1- 
hydroxybenzotriazole,  TFA=trifluoroacetic  acid,  Alloc  =  allyloxycar- 
bonyl. 
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Scheme  13.  Synthesis  of  pyrrole  polyamides  linked  with  an  IDA  scaffold. 


Figure  22.  Extended  2:1  parallel  side-by-side  binding  of  IDA-linked  pol¬ 
yamides.  The  best  results  are  obtained  with  R=  (CH2)2NHBoc. 


approach  in  the  convergent  parallel  synthesis  of  a  library  of 
key  analogues  of  HUN-7293  was  used  to  define  in  detail  the 
structure-function  relationships  of  the  natural  product 
(Scheme  15).'75'76' 

A  solution-phase  approach  to  the  simultaneous  prepara¬ 
tion  of  the  library  was  utilized  enlisting  simple  acid-base 
liquid-liquid  extractions  for  isolation  and  purification  of  the 
synthetic  intermediates  and  final  products  (Scheme  16). 
Notably,  this  approach  provided  each  intermediate  and  final 
product  sufficiently  pure  for  subsequent  use  even  in  the 
multistep  synthesis  of  HUN-7293  and  its  structurally  chal¬ 
lenging  analogues.  The  library  included  an  alanine  scan 
of  the  seven  residues,  an  N-methyl  deletion  of  the  three 
/V-methyl  residues,  and  fundamental  simplifications  in 
each  of  the  nonstandard  amino  acid  side  chains  allowing 
the  identification  of  key  residues  and  structure  features 
contributing  to  the  biological  properties.  The  simulta¬ 
neous  inclusion  of  simplified  analogues  of  the  non¬ 
standard  amino  acid  side  chains  permitted  the  identi¬ 
fication  of  not  only  essential  structural  features,  but  also 
non-obvious  and  subtle  structural  features. 

Alanine-scanning  mutagenesis  is  an  informative 
method  for  defining  epitopes  in  proteins.1416'  By  replac¬ 
ing  each  amino  acid  side  chain  with  a  methyl  group  while 
retaining  the  stereochemistry  of  the  peptide  backbone, 
the  significance  of  each  individual  side  chain  can  be 
identified.  Although  this  approach  has  its  limitations,  it 
has  been  used  to  provide  a  general  insight  into  structur¬ 
e-function  relationships  of  a  protein,  especially  when 
the  functional  mechanism  is  unknown  or  its  3D  structure 
is  unavailable.'416'  Similarly,  in  the  absence  of  a  defined 
target,  the  importance  of  each  residue  of  HUN-7293 
could  not  be  anticipated.  Along  with  additional  mod¬ 
ifications  and  simplifications  in  each  of  the  individual 
side  chains,  an  alanine  scan  of  the  structure  was 
conducted.  The  results  of  their  examination  are  sum- 
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BocNHX-CONH  8CONH-J-CO,Fr 
BocNH  X-CONH  9-CONH  3-COjEt 
BosNH  X-CONH  1 1-  CONH-3-CO^Et 


Strong  disruption  No  growth  j 

No  disruption  Moderate  growth 

No  disruption  Good  growth 

Moderate  disruption  Good  growth 


Figure  23.  Disruption  of  LEF-1  DNA  binding  and  inhibition  of  LEF-l/p- 
catenin-mediated  gene  transcription.  The  lead  structures  69  and  70 
have  a  tenfold  transcription  repression  (TOPFLASH)  and  threefold 
selectivity  (TOPFIASH/FOPFLASH) 


marized  in  Table  8  and  they  provided  an  initial,  albeit 
surprisingly  incomplete,  overview  of  the  relative  importance 
of  each  residue.  With  the  exception  of  residue  7,  which 
already  contains  a  methyl  group  in  the  natural  product  (R7  = 
Me),  the  replacement  of  each  side  chain  with  a  methyl  group 
led  to  a  reduction  in  potency.  However,  the  magnitude  of  this 
loss  in  activity  varied  over  a  1000-fold  range  with  the  R1 
replacement  being  only  about  two-times  lower,  the  R2-R4 
replacements  leading  to  10-100-fold  reductions,  and  the  R5 
and  R6  replacements  leading  to  even  larger,  over  100-1000- 
fold,  reductions  with  R5  being  by  far  the  most  significant 
residue.  Just  as  interesting  and  despite  the  potency  reductions, 
the  methyl  substitutions  for  the  R'-R4  side  chains  enhanced 
the  VCAM-1  versus  ICAM-1  selectivity  indicating  that  the 


Table  7:  Natural  product  libraries. 


Natural  product 

Library  format 

Size 

Reference 

Lavendustin  A 

Solid  Phase 

61 

[370] 

Balanol 

Solid  Phase 

32 

[371] 

Olomoucine 

Solution  and  Solid 
Phase 

364 

[372] 

Rauwolfa  Alkaloids 

Solid  Phase 

792 

[373] 

Epothilones 

Solid  Phase 

180 

[374] 

Prostaglandin  E2  and 

f2„ 

Soluble  Polymer  Sup¬ 
port 

2/16/6 

[375] 

Taxoids 

Solid  Phase 

400 

[376] 

Oscillamide  Y 

Solid  Phase 

4 

[377[ 

(S)-Zearalenone 

Solid  Phase 

1 

[378] 

Cycloserine 

Solid  Phase 

80 

[379] 

Flavones 

Solution  Phase 

36 

O 

OO 

Muscone 

Solid  Phase 

12 

[381] 

Sarcodictyin 

Solution  Phase 

15/53 

[382] 

Taxoids 

Solution  Phase 

26 

[383] 

Prostaglandin  E  and  F 

Solid  Phase 

26 

[384] 

Fumitremorgin  C 

Solid  Phase 

42 

[385[ 

Indolactam  V 

Solid  Phase 

31 

[386] 

Vitamin  D3 

Solid  Phase 

1 

[387] 

Kramerixin 

Solution  Phase 

120 

[388[ 

Vancomycin 

Solution  and  Solid 
Phase 

39,304 

[389] 

HUN-7293 

Solution  Phase 

1/40 

[75,76] 

Carpanone 

Solid  Phase 

6 

[390] 

Distamycin  A 

Solution  Phase 

2640 

[87] 

Distamycin  A 

Positional  Scanning 

1000 

[86] 

Triostin  A 

Solution  Phase 

1 

[72] 

Fumiquinazoline 

Solid  Phase 

27 

[391] 

Benzopyrans 

Solution  and  Solid 
Phase 

>10000 

[392] 

Steroids 

Solid  Phase 

20 

[393] 

Curacin  A 

Solution  Phase 

18 

[394] 

Stipiamide 

Solution  Phase 

42 

[395] 

CC-1065 

Solution  Phase 

132 

[88] 

Galanthamine 

Solid  Phase 

2527 

[397] 

Mappicine 

Solution  Phase 

112/560 

[396] 

Tambjamine 

Solution  Phase 

10 

[398] 

Vitamin  D3 

Solid  Phase 

72 

[399] 

Vitamin  D3 

Solution  and  Solid 
Phase 

5 

[400] 

Psammaplin  A 

Solution  Phase 

3828 

[401] 

Hapalosin 

Solution  and  Solid 
Phase 

6 

[402] 

Nakijiquinones 

Solution  Phase 

56 

[403] 

Steroids 

Solution  Phase 

298 

[404] 

Ceramide 

Solid  Phase 

528 

[405] 

Dysidolide 

Solid  Phase 

8 

[406] 

Natural  product-like 

[369] 

(review) 

changes  in  this  half  of  the  molecule  affected  inhibition  of 
ICAM-1  expression  more  significantly,  whereas  this  selectiv¬ 
ity  was  lost  along  with  the  activity  with  methyl  substitutions  at 
R5  and  R6  with  the  change  at  R5  again  being  especially 
detrimental.  From  these  results  alone,  one  could  anticipate 
that  substantial  changes  in  the  R1  side  chain  could  be 
tolerated  including  efforts  to  enhance  selectivity  or  physi- 
ochemical,  distribution,  stability,  and  toxicity  profiles, 
whereas  changes  to  R2-R4  might  prove  more  challenging, 
and  those  to  R5  (a  complete  set  of  modifications  of  R7  is 
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Table  8:  Alanine  scan  of  HUN-7293.  Inhibitory  activity  (ICSo  [nM])  for 
VCAM-1  and  ICAM-1  protein  expression  in  human  microvascular  cell  line 
(HMEC-1).1*1 


R6  R7  R' 


HUN-7293|bl  1  24  24  1 

79,  R'  =  Me  2.3 ±0.5  178±18  77  0.4 

80,  R7  =  Me  100  ±34  6300  ±370  63  0.01 

81,  R’  =  Me  56±11  4100±2500  73  0.02 

82,  R4  =  Me  1 6 ±  1 .4  700±20  43  0.06 

83,  R5  =  Me  1 800 ±10  2800±210  1.6  0.0006 

84,  R6=Me  270 ±40  1400±300  5.2  0.004 


[a]  Unless  otherwise  indicated,  the  substituents  are  those  found  in  HUN- 
7293.  [b]  R7  =  Me,  HUN-7293. 


presented  in  Table  9)  and  R6  are  likely  to  be  especially 
detrimental.  Consequently,  the  results  of  the  more  subtle  side- 
chain  modifications  proved  especially  valuable  at  addressing 
inaccuracies  in  interpreting  such  an  incomplete  data  set  from 
the  alanine  scan. 

An  assessment  of  the  impact  of  each  residue,  each  side 
chain,  and  each  structural  feature  of  HUN-7293  was  estab¬ 
lished  enlisting  a  complete  library  of  analogues  composed  of 
systematic  single-point  changes  in  the  structure  prepared  by 
parallel  solution-phase  synthesis.  Over  40  new  compounds 
were  assembled  by  superimposing  a  divergent  synthetic 
strategy  onto  a  convergent  total  synthesis.  The  combination 
of  the  beneficial  attributes  of  a  conventional  convergent 


prepare  all  22  tetrapeptide  analogues 

Scheme  14.  Parallel  synthesis  of  a  library  of  HUN-7293  analogues: 
divergent  preparation  of  a  library  by  using  a  convergent  total  synthesis. 
All  the  intermediates  and  products  were  isolated  and  purified  by 
liquid-liquid  acid-base  extraction.  ©~®  indicate  disconnections. 


synthetic  strategy  with  the  divergent  preparation  of  a  series  of 
structurally  related  analogues  can  only  be  conducted  using 
solution-phase,  not  solid-phase,  techniques  and  avoids  the 
more  repetitive  independent  linear  synthesis  of  each  ana¬ 
logue.  This  generalized  library  approach  of  chemical  muta¬ 
genesis  of  the  cyclic  depsipeptide  provided  a  detailed  first 
level  structure-activity  study  that  subsequent  optimization 
efforts  can  be  confidently  based  on.  One  of  the  most  striking 
conclusions  of  the  study  was  that  a  simple  alanine  scan  of  the 
naturally  occurring  depsipeptide  may  provide  a  sense  of  the 


Scheme  15.  Construction  of  HUN-7293  pharmacophore  library  via  solution-phase  parallel  synthesis;  X  =  0,  NH;  Y  =  H,  Me;  Bn  =  benzyl, 
Cbz  =  benzyloxycarbonyl.  For  more  details  see  text. 
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Me  O 

-ci*h2NxA0R,  + 

R! 

R'  =  CH3,  R!  =  CH3(71) 

R’  =  fill,  R2  =  CH2CH(CH3)2(72) 


73 


a 


HNBoc  EDCI,  HOAt 

C°2H  NaHCOj 
from  citronellal 


HNBoc  Me  O 

‘  N.X  , 


OR1 

O  Rz 

R1  =  CH3,  R2  =  CH3(74r  84%) 

R1  =  fBu,  R2  =  CH2CH(CH3)2  (75,  91%) 


1 —  R'  =  CH3 
LiOH[^Rl  =  OH 


•< 


1 )  HCI,  EtOAc 

2)  EDCI.  HOAt,  NaHC03 
83%  (2  steps) 

HNBoc  Me 

N.  ^CO,H 


O  R 

76  65%  library 

1) HC02H 

2)  EDCI,  HOAt,  NaHC03 
65%  (2  steps) 


HOv^/CN 


tripeptide  5 


HO' 


OH  'S  (from  L-GIn) 


Scheme  16.  Synthesis  of  HUN-7293  and  analogues.  A  Mitsunobu  esterification  permits  the  use  of  the 
L-Gln-derived  alcohol  precursor  77  and  avoids  the  racemization  observed  with  activation  of  the  NMe- 
Ala  C-terminus.  With  78  epimerization  upon  macrolactamization  is  minimized  with  DPPA  (diphenyl- 
phosphorylazide).  HOAt  =  l-hydroxy-7-azabenzotriazole,  DIAD  =  diisopropyl  azodicarboxylate. 


importance  of  each  residue,  but  it  provides  an  incomplete 
data  set  that  is  problematic  to  draw  substantive  conclusions 
from  since  each  nonstandard  residue  or  side  chain,  including 
residues  5  and  6,  could  be  simplified  without  substantially 
influencing  the  potency.  Although  similar  approaches  have 
been  disclosed  to  optimize  or  explore  a  single  residue  within  a 
naturally  occurring  cyclic  peptide,  we  are  not  aware  of  an 
instance  where  all  the  residues  were  examined  simultane¬ 
ously.  Moreover,  a  more  common  approach  to  initial  struc¬ 
ture-activity  relationship  studies  of  such  natural  products  has 
been  systematic  derivatization  or  degradation  studies  to 
establish  the  structural  features  important  for  activity.  These 
efforts,  like  an  alanine  scan,  provide  an  incomplete  data  set  on 
which  to  base  further  studies. 

7.  Summary  and  Outlook 

The  complementary  techniques  of  solid-phase  and  solu¬ 
tion-phase  combinatorial  chemistry  provide  rare  opportuni¬ 
ties  to  rapidly  probe  challenging  targets,  especially  those  for 
which  no  known  small-molecule  leads  exist  and  for  which 
there  is  little  or  no  structural  information  available.  Split-and- 


mix  solid-phase  synthesis  and  sol¬ 
ution-phase  mixture  synthesis 
have  substantially  increased  the 
capacity  for  preparing  candidate 
libraries  and  their  value  has 
increased  as  deconvolution  techni¬ 
ques  for  rapidly  or  easily  identify¬ 
ing  screening  leads  have  been 
introduced.  Among  its  many 
attributes,  solution-phase  library 
synthesis  permits  the  utilization 
of  the  widely  accepted  convergent 
synthesis  strategy,  easily  accom¬ 
modates  mixture  synthesis,  is 
adaptable  to  dynamic-  or  selec¬ 
tion-screening  including  target- 
assembled  ligands,  and  is  generally 
not  limited  by  scale.  Moreover,  the 
advances  in  soluble  polymer-sup- 
ported  synthesis  and  the  wide¬ 
spread  success  of  strategies  enlist¬ 
ing  polymer-supported  reagents, 
scavengers,  and  workup  reagents 
(inverse  solid-phase  synthesis)11651 
suggest  that  the  common  assump¬ 
tion  that  library  synthesis  is  best 
conducted  on  solid  phase  may  not 
always  be  accurate  and  that  the 
value  of  solution-phase  library 
synthesis  may  still  be  underesti¬ 
mated. 

Our  present  library  of  approx¬ 
imately  40000  compounds  is  in  the 
process  of  being  expanded  to 
1000000  compounds  with  a  focus 
on  assembling  a  series  of  libraries 
capable  of  adequately,  if  not  fully,  exploring  pharmacophore 
space  in  a  single  round  of  screening,  incorporating  specific 
design  elements  for  targeting  protein-protein  interactions 
(e.g.  RGD  mimetics,  a-helix  and  p-shect  mimetics),  exploring 
new  templates,  and  implementing  new  solution-phase  techni¬ 
ques  including  target-assisted  synthesis  for  selection  of 
candidate  ligands.  The  fact  that  the  libraries  are  generated 
on  a  scale  permitting  their  repeated  use  in  new  assays  as  the 
opportunities  arise  ensures  their  value  will  only  increase  over 
time. 

It  is  likely  that  the  years  ahead  will  be  characterized  by  an 
increasing  focus  on  the  application  of  the  technology  more 
than  the  methodology  itself,  and  this  may  very  well  be  the  era 
in  which  the  most  useful  approaches  naturally  emerge.  It  will 
also  mark  a  time  where  the  problem  selection  may  distinguish 
the  notable  contributions  from  those  that  are  overlooked.  The 
future  issue  may  well  be  not  just  the  identification  of  effective 
modulators  (inhibitors  or  mimics)  of  target  protein-protein  or 
protein-DNA  interactions,  but  the  identification  of  selective 
modulators.  A  first  step  in  such  an  analysis  for  protein-DNA 
interactions  was  taken  with  the  introduction  of  the  FID  assay 
which  permits  the  simultaneous  examination  of  all  possible 
DNA  sequences.  This  allows  an  assessment  of  affinity  for  not 
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Table  g:  R5  modification  of  HUN-7293.  Inhibitory  activity  (ICS0  [rM])  for 
VCAM-1  and  ICAM-1  protein  expression  in  human  microvascular  cell  line 
(HMEC-l).1"1 


90,  R  =  CH2Ph 


R>T  94,  R  =  Cbz 

OMe  95,  R  =  H 


Compound 

VCAM-1 

ICAM-1 

Selectivity 

Rel. 

potency 

HUN-7293 

1 

24 

24 

i 

85,  Y5  =  H 

>10000 

>10000 

n.d.^ 

<0.0001 

86 

1 800  ± 1 0 

2800  ±210 

1.6 

0.0006 

87 

44±11 

5 1 0  ± 1 40 

12 

0.02 

88 

1.7±1.6 

1 9  ±  1 .2 

u 

0.6 

89 

21  da  1.6 

186  ±3 

9 

0.05 

90 

19±1.2 

16±13 

0.8 

0.05 

91 

38±5 

21 00  ±170 

55 

0.03 

92 

11  ±2.6 

71  ±35 

6 

0.09 

93 

29  ±6 

94  ±13 

3 

0.03 

94 

380±  1 20 

890  ±340 

2.3 

0.003 

95 

1400  ±230 

>7000 

>5 

0.0007 

96 

2.3  ±0.6 

43  ±6 

19 

0.4 

97 

63  ±4 

720  ±90 

11 

0.02 

CjJ-epi-97 

190  ±20 

9700  ±3300 

51 

0.005 

98 

81 0  ± 1 30 

8200  ±120 

10 

0.001 

99 

280  ±7 

480  ±65 

1.7 

0.004 

C2!-epi-99 

1 1 00  ±  240 

2600  ±440 

2.4 

0.0009 

[a]Ys  =  Me  (HUN-7293  unless  otherwise  stated).  [b]n.d.  =  not  deter¬ 
mined. 


only  the  desired  target  (sequence),  but  also  a  simultaneous 
assessment  of  selectivity  among  all  targets  (sequences)  that 
may  be  used  to  distinguish  between  otherwise  similar  leads. 
Proteomics  approaches  towards  examining  ligand  interac¬ 
tions  with  all  proteins  are  emerging14171  that  may  be  used  to 
establish  ligand  selectivity,1”1  and  analogous  approaches  to 
simultaneous  or  parallel  screening  for  modulating  protein- 
protein  interactions14181  are  on  the  horizon. 
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The  synthesis  of  the  alkylating  subunit  of  the  DNA  cross-linking  agent,  isochrysohermidin  (2), 
and  its  subsequent  incorporation  into  conjugates  with  distamycin  A  (1)  are  described.  The  DNA 
binding  properties  of  these  agents  were  compared  to  that  of  distamycin  A,  using  a  fluorescence 
intercalator  displacement  (FID)  assay. 


Introduction 

Gene  expression  is  regulated  by  a  host  of  inhibitor  and 
enhancer  proteins  that  selectively  bind  to  specific  se¬ 
quences  of  DNA.  The  selective  disruption  of  this  process 
by  small  molecules,  which  bind  to  DNA  in  a  sequence- 
specific  manner,  may  provide  access  to  new  therapeutics. 
Among  this  class  of  agents,  distamycin  A  is  one  of  the 
most  widely  studied.  Distamycin  A,  originally  isolated 
from  Streptomyces  sp.,1  is  a  minor  groove  binding  agent 
with  sequence  specificity  toward  A— T  rich  sites  within 
duplex  DNA.  Its  sequence  specificity  and  high  affinity  is 
derived  from  a  combination  of  interactions  including 
hydrogen  bonding,  van  der  Waals  contacts,  and  electro¬ 
static  interactions  of  the  cationic  amidine  side  chain  with 
the  phosphate  backbone  of  DNA.2  The  more  recent 
discovery  of  2:1  complexes,3  their  elaboration  into  side- 
by-side  antiparallel  y  hairpin  polyamides,  and  the  advent 
of  Dervan’s  pairing  rules  with  the  template  modifications 
to  selectively  recognize  G  (Im  vs  Py)  or  A  (Hp  vs  Py) 
provides  a  powerful  paradigm  on  which  to  design  se¬ 
quence-selective  DNA  binding  agents.4  A  number  of 
studies  have  examined  the  consequences  of  incorporating 
nonselective  alkylating  agents  into  the  distamycin  struc¬ 
ture  (e.g.  a-haloacylamides,  nitrogen  mustards).5  A  more 
limited  series  of  studies  have  examined  conjugates  with 


(1)  Arcamone,  F.;  Penco,  S.;  Orezzi,  P.  G.;  Nicolella,  V.;  Pirelli.  A. 
Nature  1964.  203,  1064. 

(2)  Kopka,  M.  L.;  Yoon,  C.;  Goodsell,  D.;  Pjura,  P.;  Dickerson,  R.  E. 
Proc.  Natl.  Acad  Sci.  U.S.A.  1985,  82,  1376. 

(3)  Pelton,  J.  G.;  Wemmer,  D.  E.  J.  Am.  Chem.  Soc.  1990,  112,  1393. 

(4)  Dervan,  P.  B.  Bioorg.  Med.  Chem.  2001,  9,  2215. 

(5)  For  recent  examples,  see  a-Haloacylamides:  (a)  Cozzi,  P.;  Beria, 
I.;  Caldarelli,  M.;  Capolongo,  L.;  Geroni,  C.;  Mongelli,  N.  Bioorg.  Med. 
Chem.  Lett .  2000,  10,  1269.  (b)  Cozzi,  P.  Farmaco  2001,  56,  57.  (c) 
Cozzi,  P.;  Mongelli,  N.  Curr.  Pharm.  Des.  1998,  4,  181.  (d)  Krowicki, 
K.;  Balzarini,  J.;  De  Clerq,  E.;  Newman,  R.  A.;  Lown,  J.  W.  J.  Med. 
Chem.  1988,  31,  341.  (e)  Baker,  B.  F.;  Dervan.  P.  B.  J.  Am.  Chem. 
Soc.  1985,  107,  8266.  Nitrogen  mustards:  (f)  Wang,  Y.;  Wright,  S.  C.; 
Larrick,  J.  W.  Bioorg.  Med.  Chem.  Lett.  2003,  13.  459.  (g)  Baraldi,  P. 
G.;  Romagnoli,  R.;  Guadix,  A.  E.;  Pineda  de  la  Infantas.  M.  J.;  Gallo. 
M.  A.;  Espinosa,  A.;  Martinez.  A.;  Bingham,  J.  P.;  Hartley,  J.  A.  J. 
Med.  Chem.  2002,  45,  3630.  (h)  Cozzi.  P.  C.;  Beria,  I.;  Caldarelli,  M.; 
Capolongo,  L.;  Geroni,  C.;  Mazzini,  S.;  Ragg.  E.  Bioorg.  Med.  Chem. 
Lett.  2000,  10.  1653.  (i)  Xie,  G.;  Gupta,  R.;  Lown,  J.  W.  Anti-Cancer 
Drug  Des.  1995,  10.  389.  (j)  Sigurdsson,  S.  T.;  Rink,  S.  M.;  Hopkins, 
P.  B.  J.  Am.  Chem.  Soc.  1993,  115,  12633.  (k)  Sondhi,  S.  M.;  Praveen- 
Reddy,  B.  S.;  Lown,  J.  W.  Curr.  Med.  Chem.  1997,  4,  313. 
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selective  alkylating  agents  (e.g.  CBI,6  duocarmycin  A7) 
that  combine  the  noncovalent  binding  selectivity  inherent 
in  the  distamycin  conjugate  with  the  alkylation  selectiv¬ 
ity  to  further  enhance  binding  selectivity  and  affinity. 

In  recent  studies,  we  described  the  total  synthesis  of 
isochrysohermidin  (2)  and  disclosed  the  first  report  of  its 
interstrand  DNA  cross-linking  properties.8  Isolated  from 
Mercurialis  perennis  L.,  both  d,l-  and  meso- forms  were 
found  to  occur  naturally  with  the  e/,/-diastereomer  un¬ 
ambiguously  identified  by  X-ray  crystallography.9  The 
dimeric  Af-methylcarbinolamides  undergo  a  slow  ring¬ 
opening  event  during  the  interconversion  of  d,l-  and 
meso-2.  This  ring-opening  reaction  exposes  an  electro¬ 
philic  carbonyl  capable  of  trapping  nucleophiles  within 
the  minor  groove  of  duplex  DNA.  The  only  nucleophile 
readily  accessible  to  minor  groove  bound  isochrysoher¬ 
midin  is  believed  to  be  the  C2  amine  of  guanine.  By 
incorporation  of  a  single  carbinolamide  subunit  of  iso¬ 
chrysohermidin  into  distamycin,  we  sought  to  establish 
whether  it  may  be  possible  to  direct  a  reversible  (vs 
irreversible)5"7  guanine  alkylation  near  adjacent  A— T 
rich  sites  within  duplex  DNA  (Figure  1). 

Results  and  Discussion 

The  distamycin  analogues  were  prepared  by  solution- 
phase  synthesis  requiring  only  acid/base  liquid-liquid 
extraction  protocols  for  the  isolation  and  purification  of 
the  distamycin  subunits.  The  amidine  side  chain  found 
in  the  natural  product  was  replaced  with  a  A/,A/-dim- 
ethylaminopropylamine  side  chain  to  facilitate  the  ease 
of  synthesis.10  This  substitution  is  well-documented  and 


(6)  (a)  Chang,  A.  Y.;  Dervan,  P.  B.  J.  Am.  Chem.  Soc.  2000,  122, 
4856.  (b)  Kumar  R.;  Lown,  J.  W.  Org.  Lett.  2002,  4,  1851.  (c)  Jia,  G. 

F. ;  Iida.  H.;  Lown,  J.  W.  Heterocycl.  Common.  1998,  4,  557.  (d)  Jia, 

G. ;  Iida,  H.;  Lown.  J.  W.  Chem.  Commun.  1999.  119.  (e)  Jia,  G.;  Iida, 

H. ;  Lown,  J.  W.  Synlett  2000,  603.  (f)  Boger,  D.  L.;  Schmitt,  H.;  Fink, 
B.  E.;  Hedrick,  M.  P.  J.  Org.  Chem.  2001,  66.  6654. 

(7)  Tao,  Z.-H.;  Fujiwara,  T.;  Saito,  I.;  Sugiyama,  H.  Angew.  Chem., 
Int.  Ed.  1999.  38,  650. 

(8)  Boger,  D.  L.;  Baldino,  C.  M.  J.  Am.  Chem.  Soc.  1993,  115,  11418. 

(9)  Matsui,  Y.;  Kawabe,  C.;  Matsumoto,  K.;  Abe,  K.;  Miwa,  T. 
Phytochemistry  1989,  25,  1470. 

(10)  Boger,  D.  L.;  Fink,  B.  E.;  Hedrick,  M.  P.  J.  Am.  Chem.  Soc. 
2000,  122  6382. 
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3:  n  =  2,  R  =  OMe 

4:  n  =  2,  R  =  HN(CH2)3N(Me)2 

5:  n  =  3,  R  =  OMe 

6:  n  =  3,  R  =  HN(CH2)3N{Me)2 


SCHEME  2 
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13 


n 

"N 

Me 


R  =  H2N(CH2)3N(Me)2  Me  u 
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SCHEME  3 


FIGURE  I.  Distamycin  A  (1),  isochrysohermidin  (2),  and 
hybrid  agents  (3—6). 
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does  not  adversely  affect  minor  groove  binding  affinity 
or  selectivity.  The  distamycin  subunits  10,  12,  13,  and 
15  were  prepared  as  previously  detailed10  by  coupling  7 
with  8  in  the  presence  of  EDCI  and  DMAP  to  afford  9  in 
90%  yield  (Scheme  1).  Subsequent  treatment  of  9  with 
anhydrous  4  M  HCI/EtOAc  followed  by  coupling  with  7, 
in  the  presence  of  EDCI/DMAP,  provided  the  distamycin 
core  11  in  86%  yield  as  detailed  in  our  prior  efforts. 
Subsequent  treatment  of  peptides  9  and  11  with  4  M  HCI/ 
EtOAc  removed  the  BOC  group  and  provided  the  corre¬ 
sponding  HCI  salt  of  the  methyl  ester  derivatives  of  the 
distamycin  subunits  (10  and  12). 

Distamycin  subunits  13  and  15,  incorporating  the  N,N- 
dimethylaminopropylamine  side  chain,  were  accessed 
from  9  as  previously  described.10  Saponification  with 
LiOH  was  followed  by  the  addition  of  /V,/V-dimethylami- 
nopropylamine  and  PyBOP  to  afford  the  corresponding 
adduct.  Subsequent  treatment  with  4  M  HCI/EtOAc 
provided  13  in  66%  yield  over  three  steps  (Scheme  2). 
The  tripeptide  14  was  prepared  by  coupling  dipeptide  13 


19 


.OMe 


J  V 

H02C-"^N^C02Me 


'02 


89% 


OMe 

rv0H 

C°2Me 


co2h 


co2h 


20 


21 


with  7,  in  the  presence  of  EDCI  and  DMAP,  and  provided 
the  corresponding  tripeptide  42%  yield.  Treatment  of  14 
with  4  M  HCI/EtOAc  removed  the  BOC  group  and 
provided  the  corresponding  HCI  salt  of  the  distamycin 
subunit  15  in  quantitative  yield. 

The  trisubstituted  pyrrole  precursor  to  the  isochryso¬ 
hermidin  alkylation  subunit  was  prepared  utilizing  a 
1,2,4,5-tetrazine  — ■  1,2-diazine  — *  pyrrole  Diels— Alder 
strategy  (Scheme  3). 11  An  inverse  electron  demand  Diels— 
Alder  reaction  of  1,2,4,5-tetrazine  1612  and  dimethoxy- 
ketene  acetal13  provided  the  corresponding  1,2-diazine 
1714  in  84%  yield.  A  subsequent  reductive  ring  contrac¬ 
tion  of  the  1,2-diazine  was  achieved  by  using  freshly 
activated  zinc  dust  in  glacial  acetic  acid  to  provide  the 
trisubstituted  pyrrole  1814  in  67%  yield.  The  four-carbon 
tether  was  installed  by  /V-alkylation  of  18  with  methyl 


(11)  For  reviews:  (a)  Boger.  D.  L.  Tetrahedron  1983,  39,  2869.  (b) 
Boger.  D.  L.  Chem.  Rev.  1986,  86,  781.  (c)  Boger,  D.  L.  Bull.  Soc.  Chim., 
Belg.  1990,  99,  599.  (d)  Boger,  D.  L.  Chemtracts:  Org.  Chem.  1996,  9, 
149.  (e)  Boger,  D.  L.;  Weinreb,  S.  M.  Hetero  Diels— Alder  Methodology 
in  Organic  Synthesis,  Academic:  San  Diego,  CA,  1987. 

(12)  Boger.  D.  L.;  Coleman.  R.  S.:  Panek,  J.  S.:  Huber,  F.  X.;  Sauer, 
J.  J.  Org.  Chem.  1985.  50,  5377. 

(13)  (a)  Corey,  E.  J.:  Bass,  J,  D.;  LeMahieu,  R.:  Mitra,  R.  B.  J.  Am. 
Chem.  Soc.  1964,  86,  5570.  (b)  Zheng,  Q.  H.:  Su,  J.  Synth.  Commun. 
1999.  29,  3467. 

(14)  (a)  Boger.  D.  L.:  Coleman,  R.  S.;  Panek,  J.  S.;  Yohannes,  D.  J. 
Org.  Chem.  1984,  49,  4405.  (b)  Boger,  D.  L.:  Patel,  M.  J.  Org.  Chem. 
1988,  53,  1405.  (c)  Boger,  D.  L.;  Baldino,  C.  M.  J.  Org.  Chem.  1991, 
56,  6942. 
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Synthesis  of  Isochrysohermidin- Distamycin  Hybrids 

SCHEME  4 

OMe 
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^C0^H  OMe 


13:  n  =  2,  R  =  HN(CH2)3N(Me)2  4:  n  =  2,  R  =  HN(CH2)3N(Me)2,  32% 

12:  n  =  3,  R  =  OMe  5:  n  «  3,  R  =  OMe,  37% 

15:  n  =  3,  R  =  HN(CH2)3N(Me)2  6:  n  =  3,  R  =  HN(CH2)3N(Me)2,  32% 

4-bromobutyrate  (K2CO3,  DMF)  to  furnish  19  in  95% 
yield.  Highly  selective  saponification  of  the  two  sterically 
and  electronically  more  accessible  methyl  esters  provided 
the  diacid  20  (99%).  Initially,  more  elaborate,  selective 
protections  were  anticipated  to  be  necessary  to  cleanly 
provide  20  (e.g.  use  of  benzyl  4-bromobutyrate).  However, 
such  selective  protections/deprotections  proved  unneces¬ 
sary  and  simple  treatment  of  19  with  2  equiv  of  LiOH  at 
25  °C  provided  20  in  superb  conversions  (99%).  In  the 
final  step,  a  [4+2]  cycloaddition  of  '02  across  the  pyrrole 
followed  by  a  low-temperature  oxidative  decarboxylation 
with  fragmentation  of  the  intermediate  endoperoxide 
afforded  the  isochrysohermidin  subunit  21  in  89%  yield. I4c 
In  initial  efforts,  the  '02  was  generated  photochemically 
in  the  presence  of  the  photosensitizer,  Rose  Bengal. 
However,  since  the  photosensitizer  was  difficult  to  com¬ 
pletely  remove  from  the  reaction  mixtures,  a  resin-bound 
form  of  Rose  Bengal  was  used,  which  was  found  to  effect 
the  desired  transformation  without  any  decrease  in 
reactivity  or  product  yields.15 

With  the  isochrysohermidin  subunit  21  in  hand,  the 
hybrid  conjugates  were  prepared  by  coupling  with  the 
distamycin  substructures  incorporating  either  two  or 
three  TV-methyl  pyrrole  subunits.  Accessing  isochryso¬ 
hermidin— dipyrrole  analogue  3  was  achieved  by  treat¬ 
ment  of  10  with  21  in  the  presence  of  EDCI/DMAP  to 
provide  3  in  31%  yield.  Similarly,  treatment  of  dipyrrole 
13  followed  by  addition  of  21  in  the  presence  of  EDCI/ 
DMAP  provided  4  in  32%  yield  after  purification.  The 
tripyrrole  conjugates  5  and  6  were  prepared  from  the 
corresponding  tripyrroles  12  and  15,  respectively.  The 
tripyrrole  conjugate  5  was  prepared  from  coupling  12  and 
21  in  the  presence  of  EDCI  and  T-Pr2NEt  to  provide  5  in 
37%  yield  after  purification  by  column  chromatography. 
Similarly,  the  tripyrrole  conjugate  incorporating  the  TV.TV- 
dimethylpropylamine  tail  (6)  was  obtained  from  EDCI 
and  T-Pr2NEt  mediated  coupling  of  15  and  21  to  provide 
6  in  32%  yield  (Scheme  4). 

DNA  Binding  Affinity.  The  DNA  binding  properties 
of  compounds  3-6  were  first  established  by  using  a 
fluorescence  intercalator  displacement  (FID)  assay  titra- 


(15)  Bernasconi,  C.;  Cottier,  L.;  Descotes,  G.;  Nigay,  H.;  Pardon,  J. 
C.;  Wisniewski,  A.  Bull.  Soc.  Chim.  Fr.  1984,  7,  11-323. 
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TABLE  1.  DNA  Binding  Affinity  of  3 
Distamycin  and  Its  Derivatives 

-6  Compared 

Agent 

Poly-d[A]-poly-d[T]  £ 
tf(xl06  M"1) 

CG A AAA AC  A  A 

G  CTTTTTG  A  A  A 

K  (xlO6  M-1) 

distamycin  A 

15.0 

17.0 

22 

15.9 

15.5 

23 

2.1 

6.4 

3 

~ 

6.3 

4 

— 

5.9 

5 

— 

8.9 

6 

~ 

13.8 

FIGURE  2.  Distamycin  analogues  utilized  in  the  FID  assay. 

tion  to  establish  a  binding  constant  (A)16  This  method 
is  based  on  the  loss  of  fluorescence  derived  from  the 
titration  displacement  of  ethidium  bromide  from  a  DNA 
of  interest.  The  agents  were  examined  for  their  ability 
to  bind  a  hairpin  deoxyoligonucleotide  containing  a 
central  five  base  pair  AT-rich  binding  site  (AAAAA) 
adjacent  to  capping  GC  base  pairs  relative  to  distamycin 
A  and  results  are  summarized  in  Table  1. 

Additionally,  the  binding  affinities  of  several  other 
distamycin  derivatives  (22  and  23,  Figure  2)  are  included 
for  comparison.  Distamycin  A  binds  to  poly-d[A]-poly-d[T] 
and  the  5'-AAAAA-3'  hairpin  deoxyoligonucleotide  with 
essentially  the  same  affinity.  Moreover,  replacement  of 
the  amidine  side  chain  on  distamycin  with  TV,TV-dimeth- 
ylpropylamine  (22)  simplifies  the  synthesis  and  does  not 
adversely  affect  binding  affinity.  By  contrast,  substitution 
at  the  TV-terminus  of  distamycin  analogues  has  more  of 
an  impact  on  DNA  binding  affinity.  Replacement  of  the 
TV-formyl  group  with  a  sterically  bulky  BOC  group  with 
23  lowers  the  binding  affinity  and  suggests  that  large 
substituents  at  the  TV-terminus  are  not  as  well  accom¬ 
modated  in  the  minor  groove.  Dipyrrole  hybrids  3  and  4 
show  a  3-fold  decrease  in  binding  affinity  for  the  hairpin 
deoxyoligonucleotide.  These  derivatives  possess  one  less 
pyrrole  subunit  than  distamycin  and  are  therefore  ex¬ 
pected  to  be  less  effective  noncovalent  DNA  binding 
agents.  Interestingly,  both  3  and  4  exhibited  an  affinity 
greater  than  expected  and  there  is  essentially  no  differ¬ 
ence  in  binding  affinity  between  3  and  4  potentially 
representative  of  a  DNA  alkylation  event.  Tripyrrole 
derivative  5  lacks  the  C-terminal  basic  side  chain  and 
has  half  the  binding  affinity  of  the  natural  product. 
However,  by  incorporating  three  TV- m e t hy  1  py r ro  1  e  sub¬ 
units  as  well  as  the  basic  side  chain  into  the  hybrid  6,  it 
is  possible  to  obtain  a  binding  affinity  close  to  that  of 
distamycin  and  its  closest  analogue  22.  Disappointingly, 
3-6  exhibited  no  time-dependent  increase  in  binding 
affinity  indicative  of  a  slow,  reversible  covalent  attach¬ 
ment  to  DNA.  Thus,  although  the  surprisingly  effective 


(16)  (a)  Boger,  D.  L.;  Fink,  B.  E.:  Brunette,  S.  R.;  Tse,  W.  C.;  Hedrick, 
M.  P.  J.  Am.  Chem.  Soc.  2001.  123.  5878.  (b)  Boger,  D.  L.;  Tse,  W.  C. 
Bioorg.  Med.  Chem.  2001,  9,  2511. 
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behavior  of  5  might  suggest  a  covalent  attachment  to 
DNA,  the  behavior  of  6  relative  to  22  along  with  the  lack 
of  time-dependent  binding  affinity  (data  not  shown) 
suggests  it  is  not  observed.  Although  it  is  possible  that 
the  covalent  attachment  is  rapidly  reversible,  the  intrin¬ 
sic  stability  of  the  carbinolamide  of  isochrysohermidin 
(ti/2  ca.  24—48  h,  DMSO)  suggests  that  is  also  unlikely. 
Thus,  although  we  do  not  yet  have  a  good  explanation 
for  the  surprising  behavior  of  3—5,  we  are  confident  that 
it  is  not  derived  from  a  stable,  slowly  reversible  covalent 
attachment  to  DNA.17 

Experimental  Section 

Dimethyl  3-Methoxy- 1  -  [  (3-methoxycarbonyl)propyl]  - 
l//-pyrrole-2,5-dicarboxylate  (19).  Methyl  4-bromobutyrate 
(309  fiL,  2.44  mmol)  was  added  to  a  solution  of  1814  (281  mg, 
1.32  mmol)  and  K2CO3  (455  mg,  3.29  mmol)  in  anhydrous  DMF 
(20  mL).  The  reaction  mixture  was  warmed  at  80  °C  and 
stirred  under  N2.  After  4  h,  the  reaction  mixture  was  cooled 
to  25  °C,  poured  into  H2O  (100  mL),  and  extracted  with  CH2- 
CI2  (4  x  100  mL).  The  combined  organic  layers  were  dried  (Na2- 
SO4),  filtered,  and  concentrated  under  vacuum.  Chromatog¬ 
raphy  (SiC>2,  50%  EtOAc— hexanes)  afforded  19  (399  mg,  96%) 
as  a  white  solid:  mp  74—75  °C;  'H  NMR  (CDCI3,  400  MHz)  <5 
6.52  (1H,  s),  4.81  (2H,  t,  J=  7.3  Hz),  3.85  (3H,  s),  3.82  (6H,  s), 
3.64  (3H,  s),  2.31  (2H,  t,  J=  7.6  Hz),  2.04  (2H,  quint,  J=  7.9 
Hz);  13C  NMR  (CDC13,  100  MHz)  5  173.2,  161.1,  160.6,  152.5, 

124.1,  112.8,  101.0,  57.9,  51.7,  51.5,  51.4,  45.4,  31.0,  26.7; 
MALDI-HRFTMS  m/z 336. 1056  (M  +  Na+,  Ci4H19N07  requires 
336.1054). 

1  -  (4-Butyric  acid)-4-methoxy-5-methoxycarbonyl- 1 H- 
pyrrole-2-carboxyIic  Acid  (20).  LiOH-HjO  (117  mg,  2.80 
mmol)  was  added  to  a  stirred  solution  of  19  (399  mg,  1.27 
mmol)  in  a  2:1:1  solution  of  THF:MeOH:H20  (8  mL).  After  20 
h,  the  mixture  was  partitioned  between  Et20  and  H2O.  The 
aqueous  layer  was  acidified  with  the  addition  of  5%  aqueous 
HC1  (pH  3.0)  and  extracted  with  EtOAc  (4  x  20  mL).  The 
combined  EtOAc  layers  were  dried  (Na2S04),  filtered,  and 
concentrated  under  vacuum  to  afford  diacid  20  (358  mg,  99%) 
as  a  white  solid:  mp  193-195  °C;  ‘H  NMR  (CDCb,  400  MHz) 
<5  6.61  (1H,  s),  4.79  (2H,  t,  J=  7.3  Hz),  3.81  (3H,  s),  3.79  (3H, 
s),  2.25  (2H,  t,  J=  7.4  Hz),  2.04  (2H,  quint,  J=  7.9  Hz);  13C 
NMR  (CDCI3,  100  MHz)  6  176.7,  163.1,  162.8,  154.2,  126.3, 

113.8,  102.5,  58.3,  51.6,  46.3,  31.9,  28.1;  MALDI-HRFTMS  mlz 
308.0744  (M  +  Na+,  Ci2H15N07  requires  308.0741). 

Isochrysohermidin  Subunit  21.  A  3:1  solution  of  CH3- 
CN-H2O  (40  mL)  was  added  to  a  quartz  flask  charged  with 

20  (23  mg,  0.08  mmol)  and  Rose  Bengal  resin  (7.0  mg,  0.0006 
mmol).18  The  solution  was  irradiated  under  a  Hanovia  high- 
pressure  mercury  lamp  (450  W)  through  a  uranium  yellow 
glass  filter  (transmits  <330  nm)  with  a  steady  stream  of  O2 
bubbled  through  the  solution.  After  3  h,  a  small  amount  of 
activated  charcoal  was  added  to  remove  any  solubilized  Rose 
Bengal  and  the  solution  filtered  through  Celite  and  rinsed  with 
MeOH.  The  solvent  was  concentrated  under  vacuum  to  afford 

21  (21.8  mg,  86%  yield)  as  a  transparent  glass:  ‘H  NMR 
(CDCI3,  400  MHz)  6  5.14  (1H,  s),  4.62  (1H,  s),  3.79  (3H,  s), 
3.77  (3H,  s),  3.68  (1H,  dt,  J=  14.4,  6.6  Hz),  3.08  (1H,  dt,  J= 
14.5,  6.6  Hz),  2.32  (2H,  t,  J-  7.2  Hz),  1.79  (2H,  quint,  J=  7.3 
Hz);  13C  NMR  (CDC13,  100  MHz)  6  172.8,  170.9,  170.8,  167.3, 

94.8,  63.6,  59.0,  53.4,  40.2,  25.2,  23.6;  MALDI-HRFTMS  mlz 
296.0728  (M  +  Na+,  C„H,5N07  requires  296.0741). 

Compound  3.  l-(3-Dimethylaminopropyl)-3-ethylcarbodi- 
imide  hydrochloride  (EDCI)  (33  mg,  0.17  mmol)  and  DMAP 
(11  mg,  0.08  mmol)  were  added  to  a  mixture  of  21  (12  mg, 


(17)  Cytotoxic  activity:  L1210IC5o  =  67  (1),  119  (3),  165  (4),  >200 
(5),  and  >200  ^/M  (6). 

(18)  Concentration  of  resin-bound  Rose  Bengal  was  determined 
according  to  ref  15  (0.09  mol  of  Rose  Bengal/g  of  support). 
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0.04  mmol)  and  10  (29  mg,  0.08  mmol)  in  anhydrous  DMF  (0.5 
mL)  and  the  reaction  mixture  was  stirred  at  0  °C  under  N2. 
After  16  h,  the  reaction  mixture  was  diluted  with  1:1  i-PrOH— 
CHCI3  (10  mL)  and  washed  with  10%  aqueous  NaHC03  (2  x 
20  mL).  The  organic  phase  was  dried  (Na2SC>4),  filtered,  and 
concentrated  under  vacuum.  Chromatography  (Si02,  15:1 
CHCL  MeOH)  afforded  3  (3.3  mg,  31%)  as  an  off-white 
syrup:  >H  NMR  (CDCI3,  400  MHz)  6  8.84  (1H,  br  s),  7.73  (1H, 
s),  7.41  (1H,  s),  7.15  (1H,  s),  6.77  (1H,  s),  6.67  (1H  s),  5.06  (1H, 
s),  3.90  (3H,  s),  3.89  (3H,  s),  3.85  (3H,  s),  3.82  (3H,  s),  3.81 
(3H,  s),  3.57  (1H),  2.36  (2H,  t,  J=  6.4  Hz);  13C  NMR  (CDC13, 
100  MHz)  6  175.6,  172.6,  165.0,  163.2,  161.5,  124.6,  123.9, 
123.4,  122.6,  121.0,  110.5,  106.1,  94.2,  59.8,  54.1,  51.6,  39.4, 

37.1,  37.0,  34.7,  26.5;  MALDI-HRFTMS  mlz  554.1862  (M  + 
Na+,  C^H^NsOgNa  requires  554.1857). 

Compound  4.  EDCI  (29  mg,  0.151  mmol)  was  added  to  a 
mixture  of  21  (10  mg,  0.038  mmol),  13  (29  mg,  0.076  mmol), 
and  DMAP  (18  mg,  0.151  mmol)  in  anhydrous  DMF  (0.3  mL) 
and  the  reaction  mixture  was  stirred  at  25  °C  under  N2.  After 
24  h,  the  reaction  mixture  was  diluted  with  1:1  i-PrOH-CHCl3 
(10  mL)  and  washed  with  H2O  (2x10  mL).  The  organic  phase 
was  dried  (Na2S04),  filtered,  and  concentrated  under  vacuum. 
Chromatography  (RPCis-PTLC,  4:1  MeOH— 50  mM  HCO2NH4 
buffer)  afforded  4  (7  mg,  32%)  as  a  clear  syrup:  ‘H  NMR  (2:1 
CD30D-CH2C12,  500  MHz)  6  8.44  (lH,s),  7.15  (1H,  d,  J=  1.8 
Hz),  7.11  (1H,  d,  J=  1.8  Hz),  6.87  (1H,  d,  J=  1.8  Hz),  6.83 
(1H,  d,  J=  1.8  Hz),  5.18  (1H,  s),  3.88  (3H,  s),  3.87  (3H,  s), 
3.85  (3H,  s),  3.78  (3H,  s),  3.40  (4H,  m),  3.20  (3H,  m),  2.88,  (6H, 
s),  2.32  (2H,  t,  J=  7.7  Hz),  1.97  (3H,  m);  13C  NMR  (2:1  CD3- 
0D-CH2C12,  125  MHz)  <5  177.0,  176.9,  172.5,  159.6,  126.9, 

123.3,  120.8,  120.5,  106.5,  105.9,  99.8,  94.0,  59.7,  56.6,  55.5, 

43.7,  39.3,  36.6,  34.5,  30.7,  27.4,  27.3,  26.6,  26.3;  MALDI- 
HRFTMS  mlz  602.2937  (M  +  H+,  C28H39N708  requires 
602.2933). 

Compound  5.  EDCI  (35  mg,  0.18  mmol)  was  added  to  a 
mixture  of  21  (12  mg,  0.04  mmol),  12  (39  mg,  0.09  mmol),  and 
i-Pr2NEt  (16  fiL,  0.09  mmol)  in  anhydrous  DMF  (0.5  mL).  The 
reaction  mixture  was  stirred  under  N2  at  0  °C  for  3  h  and 
allowed  to  warm  to  25  °C.  After  18  h,  the  reaction  mixture 
was  diluted  with  1:1  r-PrOH-CHCl3  (lOmL)  and  washed  with 
H20  (2  x  10  mL).  The  organic  phase  was  dried  (Na2SCh), 
filtered,  and  concentrated  under  vacuum.  Chromatography 
(Si02,  12:1  CHCls-MeOH)  afforded  5  (11  mg,  37%)  as  an  off- 
white  syrup:  'l  l  NMR  (1:1  CD3OD-CD2Cl2,  500  MHz)  <5  7.68 
(1H,  s),  7.34  (1H,  d,  J=  2.2  Hz),  7.18  (1H,  d,  J=  2.2  Hz),  7.11 
(1H,  d,  J=  1.8  Hz),  6.91  (1H,  d,  J=  2.2  Hz),  6.87  (1H,  d,  J  = 

2.2  Hz),  6.80  (1H,  d,  J=  1.8  Hz),  5.12  (1H,  s),  3.89  (3H,  s), 
3.88  (3H,  s),  3.87  (3H,  s),  3.84  (3H,  s),  3.78  (3H,s),  3.77  (3H, 
s),  3.45  (1H,  dt,  J=  14.5,  7.0  Hz),  3.16  (1H,  dt,  J=  14.5,  7.0 
Hz),  2.32  (2H,  t,  J=  7.5  Hz),  1.86  (2H,  m);  13C  NMR  (1:1,  CD3- 
0D-CD2C12,  125  MHz)  6  174.7,  173.3,  172.0,  169.1,  162.7, 

160.8,  124.2,  124.1,  123.2,  122.8,  122.6,  122.0,  120.5,  120.3, 

120.1,  109.8,  105.8,  105.3,  93.7,  88.6,  78.8,  59.4,  51.4,  38.9,  36.9, 

36.7,  34.1,  30.3,  26.0;  MALDI-HRFTMS  mlz  653.2459  (M+, 
C3oH3sN7Oio  requires  653.2440). 

Compound  6.  EDCI  (31  mg,  0.180  mmol)  was  added  to  a 
mixture  of  21  (11  mg,  0.04  mmol),  15  (39  mg.  0.08  mmol),  and 
/-Pr2NEt  (14  /<L,  0.090  mmol)  in  anhydrous  DMF  (0.5  mL). 
The  reaction  mixture  was  stirred  under  N2  at  0  °C  for  3  h  and 
allowed  to  warm  to  25  °C.  After  24  h,  the  reaction  mixture 
was  diluted  with  1:1  j-PrOH-CHCl3  (10  mL)  and  washed  with 
10%  aqueous  NaHC03  (2x10  mL).  The  organic  phase  was 
dried  (Na2SO,i),  filtered,  and  concentrated  under  vacuum. 
Chromatography  (RPC,S-PTLC,  6:1  MeOH-50  mM  HCO2NH4 
buffer)  afforded  4  (7  mg,  32%)  as  a  clear  syrup:  *H  NMR  (2: 1 
CD30D-CD2C12,  500  MHz)  <5  7.17  (1H,  br  s),  7.09  (1H,  s),  6.92 
(1H,  s),  6.86  (1H,  s),  6.91  (1H,  br  s),  6.82  (1H,  s),  6.79  (1H,  s), 
5.09  (1H,  s),  3.90  (3H,  s),  3.88  (12H,  br  s),  3.39  (4H,  br  t,  J  = 

6.3  Hz),  3.13  (2H,  m),  3.05  (2H,  br  t,  J=  7.7  Hz),  2.80  (6H,  br 
s),  2.31  (2H,  m),  1.96  (2H,  m),  1.83  (2H,  m),  0.877  (2H,  br  t,  J 
=  6.6  Hz);  13C  NMR  (1:1,  CD30D-CD2C12,  125  MHz)  d  173.0, 

170.3,  169.4,  168.1,  167.0,  164.9,  161.4,  124.6,  123.8,  123.4, 
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123.3,  120.9,  120.7,  120.5,  106.5,  106.4,  105.9,  56.7,  47.4,  47.3, 
43.8,  41.1,  37.1,  37.0,  36.7,  34.4,  30.7,  27.4,  27.3,  26.6,  25.3, 
23.7,  23.1;  MALDI-HRFTMS  iriz  723.3337  (M  +  H+,  C34H45N9O9 
requires  723.3340). 

Determination  of  DNA  Binding  Constants.  A  3-mL 
quartz  cuvette  was  loaded  with  Tris  buffer  (0. 1  M  Tris,  0.1M 
NaCl,  pH  8)  and  ethidium  bromide  (0.44  x  10-5  M  final 
concentration).  The  fluorescence  was  measured  (excitation  545 
nm,  emission  595  nm,  EtBr)  and  normalized  to  0%  relative 
fluorescence.  The  5'-AAAAA-3'  hairpin  deoxyoligonucleotide 
was  added  (1.5  /dVI,  12  /<M  in  base  pair  final  concentration), 
and  the  fluorescence  measured  again  and  normalized  to  100% 
relative  fluorescence.  A  solution  of  the  agent  (3  fih,  0.1  mM  in 
DMSO)  was  added,  and  the  fluorescence  measured  following 
5  min  of  incubation  at  23  °C.  Subsequent  addition  of  3-/<L 
aliquots  of  the  agent  was  continued  until  the  system  reached 
saturation  and  the  fluorescence  remained  constant  with  suc¬ 
cessive  compound  additions. 

Scatchard  Analysis  of  the  Titration  Curve.  The  AFwas 
plotted  versus  molar  equivalents  of  agent  and  the  AFM,  was 
determined  mathematically  by  solving  the  simultaneous  equa¬ 
tions  representing  the  pre-  and  postsaturation  regions  of  the 
titration  curve.  Utilizing  eqs  1-3,  a  Scatchard  plot  was 


generated  where  AF/[free  agent]  was  plotted  vs  A F.  The  slope 
of  the  region  immediately  preceding  complete  saturation  of  the 
system  provided  —  K.K 

AFX  \  1 

_  1— =  fraction  of  DNA  —  agent  complex  (1) 

AFsat/A 


=  fraction  of  free  agent 


(2) 


[DNA]! 


A  F, 


AFat. 


=  [free  agent] 


(3) 
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ABSTRACT 

A  summary  of  the  qualitative  and  quantitative  elements  of  a 
fluorescent  intercalator  displacement  (FID)  assay  useful  for  estab¬ 
lishing  the  DNA  binding  selectivity,  affinity,  stoichiometry,  and 
binding  site  size  and  distinguishing  modes  of  DNA  binding  is 
provided. 


Introduction 

The  regulation  of  gene  expression  is  based  on  the  se¬ 
quence-selective  recognition  of  nucleic  acids  by  repressor, 
activator,  and  enhancer  proteins.  A  full  understanding  of 
the  proteins  involved,  the  delineation  of  the  sequences 
to  which  they  bind,  and  the  discovery  of  the  genes  that 
they  regulate  holds  significant  promise  in  therapeutic 
medicine.1"6  Thus,  extensive  efforts  continue  to  be  direct¬ 
ed  at  understanding  the  transcriptional  process  and  are 
being  increasingly  directed  at  the  discovery  of  small  mol¬ 
ecules  that  selectively  bind  DNA  and  activate  (block  a 
repressor)  or  inhibit  (block  an  activator)  gene  expression.7"11 

Of  the  techniques  commonly  used  to  establish  the  DNA 
binding  properties  of  small  molecules  and  proteins,  most 
are  technically  challenging,  require  the  knowledge  of 
specialized  biochemical  procedures,  and  are  time  and 
labor  intensive.  The  most  widely  used  methods  are 
footprinting12  and  affinity  cleavage.13  Because  of  the  power 
of  the  technique,  a  number  of  such  methods  have  been 
introduced  including  DNase  I,14  exonuclease  III,15  MPE— 
Fe(II),16 1,10-phenanthroline— Cu(I),17  and  EDTA-Fe(II)18 
footprinting.  Complementary  approaches  for  disrupting 
binding  (interference  footprinting)19  by  specific  base  or 
phosphate  modifications  have  also  been  introduced.  Less 
general  techniques  that  capitalize  on  a  compound’s 
intrinsic  DNA  cleavage  (e.g.,  bleomycin,  endiynes),  alkyl¬ 
ation/thermal  cleavage20  (e.g.,  CC-1065,  duocarmycins), 
alkylation  (inhibition  of  in  vitro  transcription),  or  cross- 
linking2122  (e.g.,  mitomycin)  properties  have  been  applied 
to  selected  classes  of  molecules.  Recently,  techniques  for 
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expanding  the  sequence  coverage  of  footprinting23  and  the 
use  of  electrophoretic  mobility  shift  assays  (EMSA,  gel 
retardation  assay)24  and  DNase  I  footprinting25'26  for  the 
iterative  deconvolution  of  mixture  libraries  have  been 
disclosed,  expanding  their  applications.  Inherent  in  these 
methods  is  the  characterization  of  the  highest  affinity  sites 
within  a  size-limited  segment  of  DNA.  Similarly,  the  DNA 
binding  properties  of  proteins27  are  typically  assessed  by 
selection  screening,28"30  footprinting,14  or  EMSA.31  The 
former  provides  exhaustive  sequence  coverage  for  deduc¬ 
ing  the  preferred  site(s),  but  it  selects  only  the  highest 
affinity  sites  and  does  not  provide  quantitative  binding 
information.  Footprinting  and  EMSA31  have  been  used  to 
define,  or  at  least  refine,  a  protein’s  binding  selectivity, 
but  their  most  frequent  uses  have  been  to  provide 
qualitative  distinctions  and  quantitative  comparisons 
among  candidate  binding  sites  or  those  constructed  to 
■assess  single  base  pair  (bp)  substitutions. 

Herein,  we  review  a  complementary  technique,  a  fluor¬ 
escent  intercalator  displacement  (FID)  assay,32  33  for  es¬ 
tablishing  DNA  binding  affinity,  sequence  selectivity,  and 
binding  stoichiometry.  The  assay  is  nondestructive,  tech¬ 
nically  nondemanding,  and  amenable  to  high-throughput 
screening.  The  former  feature  would  permit  DNA  im¬ 
mobilization  onto  reusable  supports  for  repetitive  use  and 
expansion  of  the  sequence  space  beyond  that  presently 
exemplified  (all  5-bp  sites).  For  a  single  compound,  the 
technique  permits  establishment  of  a  rank  order  binding 
profile  for  all  possible  5-bp  sites,  comprehensively  defining 
the  sequence  selectivity  in  a  single  experiment.  For  a 
defined  sequence,  it  permits  the  high-throughput  iden¬ 
tification  of  binding  agents  from  a  library  of  compounds 
or  quantitative  titrations  for  establishment  of  binding 
constants.  The  assay  is  not  limited  to  small  molecule 
assessments  and  has  been  used  with  a  variety  of  ligands, 
including  proteins  and  triplex-forming  oligonucleotides. 

The  FID  Assay 

The  assay  utilizes  the  displacement  of  ethidium  bromide 
(or  thiazole  orange)3435  from  hairpin  deoxyoligonucle- 
otides  (Figure  1).  Hairpin  DNAs  are  treated  with  the 
intercalator,  yielding  a  fluorescence  increase  upon  bind¬ 
ing.  Addition  of  a  DNA  binding  compound  results  in  a 
decrease  in  fluorescence  due  to  displacement  of  the 
bound  intercalator,  where  the  percent  fluorescence  de¬ 
crease  is  directly  related  to  the  extent  of  binding.  For  a 
panel  of  DNA  sequences  displayed  in  a  96-well  format, 
the  resulting  profile  of  percent  fluorescence  decrease 
provides  relative  binding  affinities  and  comprehensively 
defines  a  compound’s  sequence  selectivity.  With  indi¬ 
vidual  sequences,  quantitative  titration  of  a  compound 
against  a  hairpin  prebound  with  ethidium  provides  reli¬ 
able  binding  constants  and  the  stoichiometry  of  binding. 
These  two  systems  may  be  used  in  a  complementary 
fashion,  one  intended  for  a  high-throughput  screen 
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E-mail:  boger@scripps.edu. 

VOL.  37,  NO.  1,  2004  /  ACCOUNTS  OF  CHEMICAL  RESEARCH  61 


Fluorescent  Intercalator  Displacement  Assay  for  DNA  Binding  Tse  and  Boger 


96-well  plate  with  each  well  containing 
one  individual  sequence  The  plate  may 
contain  library  of  all  possible  sequences  to 
screen  a  single  agent.  The  plate  may 
contain  one  (or  a  few)  select  sequence(s) 
to  screen  a  library  of  compounds. 


Addition  of  a  DNA 
binding  compound 


DNA  affinity  is  measured  as  a  decrease 
in  relative  fluorescence  indicating  binding 
with  displacement  of  ethidium  bromide 

Establish  rank  order  binding.  Define  DNA 
sequence  selectivity  of  a  given  compound 
or  select  compounds  with  affinity  for  a 
given  DNA  sequence. 

FIGURE  1.  The  FID  assay. 

capable  of  handling  large  compound  libraries  or  providing 
comprehensive  sequence  preference  data,  and  one  in¬ 
tended  for  quantitative  elucidation  of  DNA-ligand  inter¬ 
actions  and  establishing  binding  constants. 

Following  the  characterization  of  the  intrinsic  fluores¬ 
cent  increase  that  accompanies  DNA  intercalation,36  the 
displacement  of  DNA-bound  ethidium  bromide  has  been 
widely  used  to  establish  DNA  binding.37  Ethidium’s  non- 
selective,38  rapidly  equilibrating,  and  low  binding  affinity 
(~105  M'1)36  allows  assessment  of  compounds  with  low 
binding  affinity  or  assessment  of  low  affinity  sites,  aspects 
not  easily  addressed  by  other  techniques.  For  tight  binding 
sequences,  the  ethidium  displacement  proceeds  in  a 
virtually  noncompetitive  manner,  permitting  quantitative 
binding  assessments. 

Thiazole  orange34  35  is  an  effective  alternative  interca¬ 
lator  that  addresses  three  issues:  (1)  its  excitation  and 
emission  maxima  are  distinct  from  those  of  ethidium,  (2) 
its  fluorescent  enhancement  upon  intercalation  exceeds 
that  of  ethidium  (ca.  3000-fold  vs  20-fold),  and  (3)  it 
displays  less  sequence-dependent  DNA  binding,  albeit 
with  a  higher  affinity.39  This  permits  the  examination  of 
systems  where  there  would  be  ethidium  fluorescence 
interference  and  screening  at  lower  concentrations  or  with 
greater  sensitivities  than  achievable  with  ethidium,  and 
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5-CGXXXXXC  A  A  Variable  region:  easily  extendable 
3-GCXXXXXG  a  a  A  from  5  base  Pairs  (shown)  to  longer 
A  sequences 

X  =  A,  T,  G,  C 


named  5'-ATGCA 

5’-cg|atgca|c  a  A  , 

3'-GC  TACGT  G  A  aA 


named  5'-TGCAT 

5'-cg|tgcat|c  a  A  . 
3'-GC  ACGTAGa  aa 


■  Each  hairpin  DNA  contains  both  complementary  5'  -  3'  sequences 

■  Hairpins,  and  therefore  sequences,  are  equivalent  if  the  position 

of  the  variable  region  is  not  considered 


\ 

Thiazole  Orange 


FIGURE  2.  Structures  of  hairpin  deoxyoligonucleotides,  ethidium 
bromide,  and  thiazole  orange. 


with  less  sequence  variation  and  greater  consistency  in 
the  absolute  fluorescence  readings.  However,  the  binding 
constants  established  with  thiazole  orange  are  slightly 
lower  than  those  established  with  ethidium.32  39  This 
reflects  the  higher  association  constant  and  greater  com¬ 
petitive  binding  of  thiazole  orange,  reducing  the  measured 
apparent  binding  constant. 

Hairpin  deoxyoligonucleotides40  (Figure  2)  proved  es¬ 
pecially  useful  in  the  FID  assay.  Embedded  in  the  hairpin 
are  two  complementary  5'-to~3'  sequences,  connected  by 
a  loop,  avoiding  the  requirement  for  two  separate  strands 
and  the  associated  additional  quantitation  and  mixing. 
The  number  of  hairpins  required  to  create  a  library  of 
sequences  is  half  the  number  of  sequences.  For  example, 
512  hairpins  are  required  for  a  library  of  all  1024  possible 
5-bp  sequences  (Figure  2).  Moreover,  the  hairpins  were 
established  to  provide  stable  duplexes  at  working  tem¬ 
peratures  (25  °C),  independent  of  the  sequence.32-33  The 
variable  most  critical  to  the  success  of  the  assay,  and  most 
likely  to  be  responsible  for  avoidable  errors,  is  the  quality 
of  the  hairpins.  In  addition  to  the  concern  about  their 
constitution  and  purity,  their  concentration  is  critical  and 
may  be  determined  by  measuring  the  UV  absorption  (260 
nm)  of  the  denatured,  single-stranded  DNA  at  80-95  °C. 
Since  the  hairpins  exist  in  a  construct  representing  a 
combination  of  double-  (stem)  and  single-strand  (loop) 
DNA  at  25  °C,  calculations  based  on  the  UV  absorption 
at  25  °C,  utilizing  the  standard  coefficients  useful  for 
single-stranded  deoxyoligonucleotides,  underestimate  the 
concentration  by  as  much  as  25%. 41 


Binding  Constants  and  Stoichiometry 

Quantitative  displacement  of  ethidium  from  a  hairpin 
deoxyoligonucleotide  provides  a  well-defined  titration 
curve  that  is  useful  for  establishing  binding  constants  and 


Fluorescent  Intercalator  Displacement  Assay  for  DNA  Binding  Tse  and  Boger 


5'-CGAATTTC  A  A 

3’-GCTTAAAG  AA 

FID  Titration  Curve 


Scatchard  Plot 


AFluorescence 

FIGURE  3.  FID  titration  curve  of  netropsin  against  a  hairpin 
deoxyoligonucleotide  containing  the  sequence  5'-AATTT,  and  Scat¬ 
chard  plot  for  Kd  (-slope)  determination. 

the  stoichiometry  of  binding.  A  plot  of  the  change  in 
fluorescence  versus  equivalents  of  compound  provides  a 
titration  curve  from  which  the  stoichiometry  of  binding 
may  be  derived  as  the  intersection  of  the  pre-  and 
postsaturation  portions  of  the  curve  (Figure  3).  This 
method,  analogous  to  that  introduced  by  Bruice  with 
Hoechst  33258, 42  43  is  easily  extended  to  analyzing  higher 
order  2:1  and  3:1  complexes.  For  1:1  binding,  binding 
constants  are  established  by  Scatchard  analysis44  of  the 
equilibrium  portion  of  the  titration  curve,  generating  a  plot 
of  A F/ [free  agent]  versus  AF,  yielding  a  linear  section 
where  the  slope  provides  Ka.  Binding  constants  produced 
by  this  indirect  technique  involving  the  displacement  of 
ethidium  yielded  results  comparable  to  those  obtained 
directly  by  monitoring  the  fluorescent  increase  of  selected 
fluorescent  DNA  binding  compounds  (e.g.,  DAPI,  Table 
1). 

DNA  Binding  Sequence  Selectivity 

Minor  Groove  Binding  Compounds.  In  developmental 
stages  of  the  assay,  distamycin,  netropsin,  DAPI,  Hoechst 
33258,  and  berenil  were  examined  (Figure  4). 32  DAPI, 
Hoechst  33258,  and  berenil  are  fluorescent  dyes,  enabling 
a  direct  assessment  of  binding.  Notably,  the  fluorescence 
enhancement  characteristic  of  their  binding  did  not  inter- 


Table  1.  DAPI  Binding  Constants 

DNA  sequence 

7C (x  106  M-‘)a 

K  (x  106  M-1)* 

5'-AATTT 

110 

120 

5'-AATAA 

59 

87 

5'-ATTAA 

52 

77 

5'-AAAAA 

50 

65 

a  Scatchard  analysis  of  ethidium  displacement  titration.  b  Direct 
titration  using  fluorescence  enhancement  of  DAPI. 


FIGURE  4. 

fere  with  the  measurement  of  the  fluorescence  decrease 
derived  from  ethidium  displacement  (e.g.,  for  EB,  ex.  545 
nm,  em.  595  nm;  for  DAPI,  ex.  372  nm,  em.  454  nm). 

Each  compound  was  screened  against  a  library  of  512 
DNA  hairpins  (1.5  /<M)  containing  all  possible  5-bp  sites 
in  a  96-well  format,  enlisting  three  compound  concentra¬ 
tions  (1.0,  1.5,  and  2.0  n M)  in  duplicate  and  a  Gemini 
SpectraMax  plate  reader.32  On  this  scale  and  at  these 
concentrations,  the  cost  of  the  purchased  hairpins  is 
approximately  $100/assay.  This  provided  a  rank  order 
binding  profile  for  all  possible  5-bp  sites  that  is  repre¬ 
sented  as  a  merged  bar  graph  in  Figure  5  for  distamycin. 
In  addition  to  rapidly  providing  a  comprehensive  defini¬ 
tion  of  each  compound’s  sequence  selectivity,  the  com¬ 
parisons  provided  insights  not  previously  easily  recog¬ 
nized.  While  all  compounds  displayed  the  expected  A/T 
binding  selectivity,  netropsin  was  the  most  A/T  selective 
and,  by  some  accounts,  distamycin  was  the  least.  All 
exhibit  tight  binding  to  5-bp  >  4-bp  >  3-bp  A/T  sites 
(Figure  6).  This  preference  is  related  in  part  to  the 
conformational  characteristics  of  DNA,  where  longer  A/T 
sites  possess  a  narrower,  deeper  minor  groove  known  to 
contribute  to  the  selective  binding.  Interestingly,  dista¬ 
mycin  displayed  an  affinity  for  GC  bp  interrupted  5-bp 
A/T  sites  (2  x  2-bp)  that  exceeded  even  that  of  3-bp  A/T 
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Increasing  AT-conteot.  increasing  affinity  I 


FIGURE  5.  FID  assay  of  distamycin  A  against  a  library  of  512  hairpin 
deoxyoligonucleotides:  (A)  merged  bar  graph;  (B)  the  top  (highest 
affinity)  25  sequences. 


A/T  Containing  Hairpins 

FIGURE  6.  Median  rank  positions  of  hairpin  DNAs  containing  five, 
four,  three,  two,  and  two  x  two  contiguous  AT  bp  for  the  five 
compounds  surveyed. 


sites  and  was  only  slightly  weaker  that  that  of  4-bp  A/T 
sites.  Netropsin  exhibited  a  weaker  preference  for  such  2 
x  2-bp  A/T  sites,  reduced  from  its  affinity  to  4-bp  A/T 
sites.  In  constrast,  DAPI,  Hoechst  33258,  and  berenil 
exhibited  a  clear  preference  for  a  3-bp  A/T  site  over  a  2  x 
2-bp  A/T  site.  Combined,  this  was  suggested  to  reflect  the 
larger  5-bp  binding  site  requirement  for  distamycin  and 
its  unique  compensating  ability  to  bind  selected  GC  bp 
interrupted  5-bp  A/T  sites.  For  each  compound,  binding 
constants  were  determined  by  ethidium  displacement 
titrations  with  selected  sequences.  These  were  found  to 
be  comparable  to  those  established  by  footprinting  or 
calorimetry,  and  to  be  nearly  identical  (within  2-fold)  to 
those  obtained  by  directly  monitoring  the  fluorescence 
increase  of  DAPI  and  Hoechst  33258  upon  DNA  binding 
(see  Table  l).32 

A  similar  screen  of  netropsin  against  a  library  of  all 
possible  4-bp  sites  was  disclosed,  and  binding  constants 
for  all  10  possible  4-bp  A/T  sites  were  determined  by 
titrations  (Table  2). 39  With  binding  constants  for  all  10 
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Table  2.  Comparative  Titration  Binding  Constants 


titrant 

DNA  sequence 

7C(x  106  M-1) 

binding 

stoichiometry 

EB 

5'-AATT 

2.7 

2.92 

TO 

5'-AATT 

15 

3.05 

ATxlO6  M-‘) 

titrant 

DNA  sequence 

ethidium  bromide 

thiazole  orange 

netropsin 

5'-AAAT 

127 

113 

5'-AAAA 

92 

71 

5'-AATT 

65 

54 

5'-AATA 

64 

44 

5'-ATTA 

45 

35 

5'-ATAT 

41 

33 

5'-ATAA 

34 

18 

5'-TAAA 

26 

15 

5'-TTAA 

11 

8 

5'-TATA 

11 

8 

PyPyPy-y-PyPyPy-Dp,  1 
AcN-PyPyPy-y-PyPyPy-Dp,  2 


ImPyPy-y-PyPyPy-Dp,  3 
ImPyPy-y-PyPyPy-p-Dp,  4 


FIGURE  7. 


possible  A/T  sites,  an  explicit  rank  order  binding  was 
established  that  was  more  comprehensive  than  that 
provided  in  all  prior  combined  footprinting  studies. 
Highlighted  in  these  studies  was  the  fact  that  the  assay 
allows  the  rapid  identification  of  the  preferred  ensemble 
of  sequences  for  a  compound,  but  that  substantive 
conclusions  drawn  about  adjacent  sequences  in  the  96- 
well  screen  should  be  reserved  for  quantitative  titrations. 

Hairpin  Polyamides.  An  analysis  of  four  pyrrole  (Py)/ 
imidazole  (Im)  hairpin  polyamides  (Figure  7)  was  con¬ 
ducted  that  not  only  served  to  establish  the  utility  of  the 
assay  for  examining  minor  groove  binding  polyamides,  but 
also  underscored  its  comprehensive  capabilities.  The 
polyamides  were  chosen  to  examine  the  subtle  effects  of 
N-  and  C-terminal  functionalization  on  the  DNA  binding 
selectivity,  representing  a  challenging  test  of  the  assay 
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Table  3.  Sequence  Selectivity  of 
ImPyPy  y  PyPyPy  jS-Dp 


A.  FID  Assay  Analysis 


1.5  pM  agent 

2.0  |xM  agent 

Sequence" 

Avg 

Rank 

Avg 

Score 

Avg 

Rank 

Avg 

Score 

5-WGWWW 

55 

0.47 

54 

0.47 

5'-WWGWW 

82 

0.44 

36 

0.55 

5  '-WCWWW 

97 

0.32 

105 

0.28 

5'-WWWWW 

131 

0.27 

111 

0.27 

B.  Binding  Constants  by  FID  Titration 


Binding  Mode* 


Sequence  Ka  ( M  's 

1.  5  -TAACAAr  2.39  10s 


5'-CG  T  A  A  C  AiX  X 

(88^ 

3'-GC  ATTG 


A 

A 

A 


2.  5’-TAACAAC 

3.  5  -TAACACC 


1.73  x  lo8 
1.33  =<  I07 


6'-CG 


3'-Gd  X 


tACAATC A 

)OOh> 

30# 

TTGTTAG A 


5'-CQ 

3'-GC 


XX 


XX 


rGTTACA 

•OCh 

-OOCK 

tCAATG A 


a  4.  5'-GAACAAT  3.31  •  I07 

A 

A  5.  5'-GAACAAC  2.84  *  |07 

6.  5'-ATTGTTA  9.87*  1 07 
a  7.  5  -GTTGTTA  9.89x10 7 

A 

a  8.  5'-GGTGTTA  2.57  x|07 

9.  5-GTTGTTC  3.30  -  I07 


»W  =  A  or  T.  bO,  Py;  •,  Im.  "Lit."18  K  =  2.9  x  10s  M"1, 
footprinting. 


capabilities.  The  complete  binding  profiles  of  the  four 
molecules  to  all  possible  5-bp  sites  were  established  using 
the  FID  assay.45 

The  comparison  of  1  and  2  revealed  the  detrimental 
effect  of  the  /V- acetyl  substitution  with  2,4647  and  the 
hairpin  polyamide  ImPyPy-y-PyPyPy-Dp  (3),  like  1,  dis¬ 
played  a  straightforward  correlation  with  its  expected 
selectivity  for  5'-WGWWW.  For  ImPyPy-y-PyPyPy-/3-Dp 
(4),  the  ability  to  screen  complete  sequence  space  resulted 
in  the  discovery  of  an  unexpected  5'-WWCWW  sequence 
specificity  (Table  3A).  Although  detailed  studies  have 
probed  the  ligand— DNA  interactions  of  this  molecule  and 
its  sequence  selectivity  was  established  (5'-WGWWW),48 
its  explicit  binding  to  5'-WWCWW  had  not  been  described. 
Use  of  the  FID  assay  to  comprehensively  screen  DNA 
provided  a  complete  picture  of  its  selectivity.  Strikingly, 
the  quantity  and  quality  of  the  data  generated  by  the  assay 
also  provided  the  basis  for  a  binding  model.  This  model 
established  that  4  has  an  optimal  binding  site  of  6  bp  (not 
the  expected  5  bp)  of  the  form  5'-WWGWWW,  and  that 
the  /1/Dp  tail  combination  requires  two  (not  one)  degen¬ 
erate  A/T  base  pairs.  FID  titrations  confirmed  the  model 
by  providing  quantitative  binding  constants  (Table  3B). 

Also  disclosed  in  this  study  was  the  analysis  of  se¬ 
quence  sets  and  two  scoring  methods  for  their  comparison 
(Table  3A).  The  first  simply  calculates  the  average  rank 


order  position  of  the  constituent  members  of  a  sequence 
set,  whereas  the  second  calculates  an  average  %F  decrease 
of  the  sequence  set  relative  to  the  highest  affinity  se¬ 
quence.  This  latter,  less  obvious  scoring  procedure  may 
better  reflect  the  relative  importance  of  binding  to  a 
sequence  set. 

Selection  of  Bindino  Agents  for  a  Defined 
Sequence(s)  from  Compound  Libraries 

Most  of  the  current  screening  technologies  are  sufficiently 
technically  demanding  that  they  are  most  often  used  to 
investigate  a  few  individual  compounds.  A  true  attribute 
of  the  FID  assay  is  its  capabilities  for  screening  libraries 
of  compounds  (mixtures  or  individual).3349 

The  rapid  screening  of  individual  sequences  was  il¬ 
lustrated  with  two  hairpin  deoxyoligonucleotides  contain¬ 
ing  two  sequences  of  the  androgen  response  element 
(ARE),  the  14-bp  ARE-consensus,  and  PSA-ARE-3. 3349 The 
emergence  of  hormone-independent,  constituently  active 
androgen  receptor  dimer  is  responsible  for  prostrate 
cancer  relapse  resistant  to  chemotherapeutic  treatment. 
At  this  stage,  competitive  inhibition  of  the  androgen 
receptor  dimer  DNA  binding  has  therapeutic  potential  and 
could  arise  from  small  molecules  with  selective  affinity 
toward  the  ARE-consensus  and  PSA-ARE-3.  Libraries  of 
distamycin-like  compounds  were  prepared  using  1 1  het¬ 
eroaromatic  subunits  in  addition  to  the  N-methylpyrrole 
native  to  distamcyin  (Figure  8).  Libraries  were  produced 
such  that  the  first  two  subunits  (B  and  C)  were  fixed, 
producing  132  mixtures  of  10  compounds  with  variations 
at  the  last  position  (A).  Screening  the  library  against  the 
two  hairpins  using  the  FID  assay  revealed  that  the  mixture 
containing  the  pyrrole  subunit  at  both  the  second  (B)  and 
third  (C)  positions  gave  the  largest  decrease  in  fluores¬ 
cence  with  the  PSA-ARE-3  sequence  which  contains  a  5-bp 
A/T  site.  The  affinity  dropped  for  the  ARE-consensus 
containing  a  GC  bp  at  the  center  of  the  5 -bp  A/T  site  of 
this  sequence.  Deconvolution  by  screening  individual 
compounds  of  this  mixture  afforded  the  distamycin 
analogue  (5)  as  having  the  highest  affinity,  followed  closely 
by  an  analogue  containing  a  thiophene  subunit  at  the 
position  A  (Figure  9).  Both  compounds  exhibited  dimin¬ 
ished  affinity  for  the  ARE-consensus  sequence  resulting 
from  the  intervening  GC  bp.33  49 

Two  other  10-compound  mixtures  also  bound  the  PSA- 
ARE-3  sequence  effectively.  Deconvolution  to  individual 
compounds  identified  6  and  7  as  tight  binders  comparable 
to  5.  Notably,  6  showed  a  loss  of  affinity  for  the  ARE- 
consensus  analogous  to  5,  but  7  retained  equal  affinity, 
making  it  ideal  for  binding  both  the  PSA-ARE-3  and  ARE- 
consensus  sequences  (Figure  9).  Impressively,  7  exhibited 
potent  (IC50  =  8  nM)  and  selective  (ca.  40-fold)  inhibition 
of  androgen  receptor-mediated  gene  transcription  in  a 
cell-based  reporter  assay,  albeit  requiring  liposome  de¬ 
livery  of  the  compound  for  cell  penetration  and  observa¬ 
tion  of  activity. 
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H  1 10 

H^,N  1  subunits 

T 


Distamycin  A 


Solution  phase  combinatorial 
chemistry  using  different 
heterocyclic  amino  acids  and 
liquid-liquid  purification 
protocols 


First  generation  libraries  of 
potential  DNA  binding  agents 

Derivatize  with  a  basic  side 
chain 

Second  generation  libraries 
of  potential  DNA  binding 
agents  with  increased  affinity 

R  =  H,  R’  =  Me  R  =  H,  R'  =  Me  R  =  H,  R’ =  Me  R  =  H,  R'  =  Me 

R  =  C02fBu<  R'  =  H  R  =  COj'Bu,  R'  =  Na  R  =  CO^u,  R'  =  H  R  **  C02*Bu,  R’  =  H 


RHN 

C02R' 

R  =  H,  R'  =  Me  R  =  H,  R’  =  Me 

R  =  COj/Bu,  R’=  H  R  =  C02*Bu,  R’  =  H 


R  “  H,  R’  =  Et  R  =  H,R'  =  Et 

R  =  C02'Bu,  R*  =  H  R  =  C02'Bu,  R’  =  H 


r  =  H,  R’  =  Me  R  =  H,  R’  =  Me 

R  =  C02'Bu,  R‘  =  H  R  =  C02'Bu,  R’  =  H 


R  =  H,  R’  =  Et  r  =  h,  R'  =  Me 

R  =  C02f8u,  R’  -  H  R  =  C02fBu,  R'  =  Na 


FIGURE  8.  Solution-phase  strategy  for  libraries  and  the  subunits  used. 


PSA-ARE-3  ARE-consensus 

A  A  CCTTGTATAAC  ATAG-5'  A  A  CCTT  GT  CGTACAAG  A-5' 
Aa  A  GGAACATATTGTATC-3'  Aa  A  GGAACAGCAT GTTCT -3‘ 


5  Individual  Compounds  6  7 

FIGURE  9.  FID  assay  for  selecting  members  of  a  library  with  affinity 
for  the  ARE-consensus  and  PSA-ARE-3. 


Establishing  Subtle  Modes  of  Binding:  Hairpin 
versus  Extended  Binding  of  Polyamides 

The  linkage  of  polyamides  with  y-aminobutyric  acid  (y) 
provides  hairpin-bound  polyamides  that  mimic  the  2:1 
side-by-side  antiparallel  binding  of  unlinked  polyamides, 
enhances  the  binding  affinity  102-104-fold,  and  improves 
the  binding  selectivity.  In  constrast,  polyamides  incorpor¬ 
ating  a  one-carbon  shorter  head-to-tail  linker,  /3-alanine 
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(ft),  bind  in  an  extended  conformation,  forming  1:1  or 
side-by-side  antiparallel  2:1  complexes.50  A  variant  of  the 
FID  titrations  was  used  to  study  the  DNA  binding  proper¬ 
ties  of  a-substituted  ^-alanine-linked  polyamides  and  a 
series  of  novel  iminodiacetic  acid  (IDA) -linked  polyamides 
(Figure  10)  ,5152  In  these  two  series,  the  bound  conforma¬ 
tion  (hairpin  versus  extended  binding  and  parallel  versus 
antiparallel  binding)  could  be  established  by  analysis  of 
FID  titrations  of  hairpin  deoxyoligonucleotides  containing 
a  systematically  varied  A/T  binding  site  size.  Complemen¬ 
tary  assessments  using  a  combination  of  footprinting  and 
affinity  cleavage  techniques1214-18  are  technically  more 
demanding,  require  the  separate  preparation  of  the  linked 
Fe-EDTA  affinity  cleavage  derivatives,  and  do  not  as  easily 
distinguish  between  such  alternative  binding  modes. 

In  the  first  of  the  two  studies,  a  series  of  Py  polyamides 
were  linked  using  a-substituted  /S-alanines  to  probe  their 
impact  on  binding.51  Polyamide  binding  was  assessed  by 
using  a  series  of  hairpin  deoxyoligonucleotides  containing 
a  systematically  varied  length  of  the  A/T  binding  site  (5— 
12  bp).  The  (f?)-OMe-substituted  derivative  9  bound  with 
moderately  high  affinity,  whereas  the  parent  /3-alanine  8 
bound  with  intermediate  affinity  to  the  shortest  hairpin 
(Figure  10).  Very  little  change  in  the  behaviors  of  8  and  9 
was  seen  until  the  length  of  the  binding  site  reached  8 
bp,  where  8  dramatically  changed.  The  binding  constant 
increased  by  2  orders  of  magnitude,  and  the  stoichiometry 
of  binding  increased  to  2.  This  corresponds  to  the 
expected  behavior  of  extended  binding  over  a  8—9-bp  site 
as  an  antiparallel  2:1  side-by-side  dimer.53-55  In  contrast- 
,the  binding  stoichiometry  of  9  remained  constant  at  1:1, 
displaying  a  binding  constant  of  ~8  x  107  M-1  throughout 
the  range  of  5—10  bp  until  a  binding  site  length  of  11  bp 
was  reached,  where  a  second  binding  event  was  observed. 
The  behavior  seen  with  the  11-  and  12-bp  A/T  sites 
represents  two  sequential  binding  events  of  9  adopting  a 
hairpin  conformation,  each  requiring  nonoverlapping 
5-bp  A/T  sites  (Figure  10). 

The  studies  illustrated  that  a  generalizable  variant  of 
the  FID  titrations  may  be  utilized  to  distinguish  hairpin 
versus  extended  binding,  provide  information  on  the 
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Systematically  vary  the  5’-CGAAAAAC  A  A  5'-CGAAAAAAAAAAAAC  A  A 
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•  side-by-side  parallel  binding 


FIGURE  10.  Establishment  of  DNA  binding  mode  for  y-aminobutyric  acid-linked,  ^-alanine-linked,  and  iminodiacetic  acid-linked  polyamides. 
Hairpin  deoxyoligonucleotides  containing  varying  A/T  site  sizes  are  utilized  to  determine  binding  constants  and  stoichiometry. 


binding  site  size  and  stoichiometry,  and  establish  absolute 
affinity.  Its  use  resulted  in  the  discovery  that,  while  most 
substituents  on  a  /3-alanine  linker  disrupt  DNA  binding 
favoring  an  extended  binding  mode,  (K) -a-methoxy-/3- 
alanine  maintains  strong  binding  affinity  and  prefentially 
adopts  a  hairpin  versus  extended  binding  mode.51 

In  a  second  disclosure,  the  DNA  binding  properties  of 
iminodiacetic  acid-linked  polyamides  were  examined 
(Figure  10).  Utilizing  the  same  series  of  hairpin  deoxy¬ 
oligonucleotides  containing  a  systematically  varied  A/T 
binding  site  size,  the  binding  affinity,  stoichiometry,  and 
binding  site  size  were  determined.  Collectively  it  was 
determined  that  the  IDA  linker  has  a  unique  effect  on  the 
ability  of  the  polyamides  to  bind  DNA,  providing  what 
appears  to  be  the  first  well-characterized  example  of 
cooperative  parallel  extended  2:1  side-by-side  binding.52 

DNA  Binding  Properties  of  Proteins 

The  DNA  binding  affinity  and  selectivity  of  proteins27  are 
typically  assessed  by  selection  screening,28-30  footprinting 
and  affinity  cleavage,14  or  EMSA.31  Each  technique  pos¬ 
sesses  unique  strengths,  and  the  FID  assay  presents  a 
complementary,  technically  nondemanding  method  for 
qualitative  or  quantitative  assessment  of  DNA  bindingwith 
concurrent  establishment  of  the  stoichiometry  of  binding. 

Lymphoid  enhancer-binding  factor56  (LEF-1,  also  re¬ 
ferred  to  as  TCF-la5758)  and  closely  related  T-cell  factors 
(TCF-159)  are  cell  type-specific  DNA  binding  proteins  that 
play  important  regulatory  roles.60  61  The  majority  of  col¬ 
orectal  tumors  contain  mutations  that  result  in  accumula¬ 
tion  of  /3-catenin.  /3-Catenin  binds  to  and  activates  tran¬ 
scription  factors  including  LEF-1,  which  binds  the  DNA 
minor  groove  through  a  high-mobility-group  (HMG) 
domain,  recognizing  a  consensus  sequence  5'-CTTTGWW 
(W  =  A  or  T)  56-63  A  recent  NMR  structure  of  the  complex 
of  the  LEF-1  HMG  domain  and  a  15-bp  deoxyoligonucleo- 


tide64  revealed  that  the  protein  makes  extensive  and 
continuous  contacts  in  the  DNA  minor  groove,  encom¬ 
passing  the  entire  region  implicated  by  chemical  foot¬ 
printing  and  mutagenesis.56-63 

FID  titration  of  the  HMG  domain  of  LEF-1  against  a 
hairpin  deoxyoligonucleotide  containing  5'-CTTTGAAG 
provided  a  well-defined  titration  curve  (Figure  11).  Scat- 
chard  analysis  provided  a  Ka  of  1.5  x  109  M-1,  virtually 
identical  to  that  established  by  EMSA.63  Once  the  viability 
of  monitoring  protein-DNA  binding  with  the  FID  assay 
was  confirmed,  several  other  sequences  were  rapidly 
evaluated,  including  those  implicated  in  initial  studies,  but 
not  subsequently  examined  in  detail.  The  third  base  of 
the  consensus  sequence,  5'-CTTTGWW,  was  permuted  to 
each  base  variation  and  revealed  that  the  C,  G,  and  A 
substitutions  bind  the  LEF-1  HMG  domain  effectively  and 
almost  indistinguishably,  exhibiting  Ka’s  less  than  2-fold 
lower  than  that  of  the  reported  consensus  sequence  (Table 
4).  As  such,  the  consensus  sequence  of  5'-CTTTGWW  is 
accurate  but  could  easily  by  refined  to  5'-CTNTGWW  (N 
=  G,  C,  A  or  T)  to  reflect  this  nearly  indiscriminant  third 
site.65 

Although  not  demonstrated  in  this  work,  the  FID  assay 
could  easily  be  extended  to  the  examination  of  a  library 
of  proteins  or  mutants  against  such  hairpins. 

Summary  and  Outlook 

The  fluorescent  intercalator  displacement  (FID)  assay  is 
a  rapid,  high-resolution,  and  technically  nondemanding 
technique  for  establishing  DNA  binding  selectivity  and 
affinity  for  small  molecules,  proteins,  and  oligonucleo¬ 
tides.66  In  a  96-well  format,  the  assay  provides  for  the  high- 
throughput  evaluation  of  a  single  compound  against  a 
library  of  DNA  sequences  (establish  sequence  selectivity) 
or  for  the  high-throughput  selection  of  high-affinity  bind¬ 
ers  for  a  defined  sequence  from  a  library  of  compounds. 
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EB  bound  Ethidium 

to  dsDNA  bromide 


FIGURE  11.  Titration  of  the  HMG  domain  of  LEF-1  versus  a  hairpin 
DNA  containing  5'-CTTTGAA. 


Table  4.  HMG/LEF-1  Binding  Constants 


DNA  sequence 


Aa  (M-1) 


5'-CTTTGAAGa 

5'-CTTTGTTG 

5'-CCCCGAAG 

5'-CTCTGAAG 

5'-CTGTGAAG 

5'-CTATGAAG 


1.5  x  109 
1.0  x  109 
<1.0  x  10s 
9.4  x  108 
8.9  x  108 
8.3  x  108 


a  Lit.63  K<i  =  1.0  x  10"9  M,  EMSA. 


Alternatively,  FID  titrations  provide  detailed  information 
on  single  compounds  and  their  binding  to  individual 
sequences,  including  binding  constants,  stoichiometry  of 
binding,  and  binding  site  size.  This  latter  application,  as 
well  as  the  selection  screening  against  a  single  sequence, 
is  amenable  to  examination  of  any  sequence  length.  Its 
use  in  screening  against  a  complete  library  of  individual 
sequences  has  been  exemplified  with  sequence  sizes  of 
<5-bp  variable  regions  requiring  a  library  of  up  to  512 
hairpin  deoxyoligonucleotides.  Longer  sequences  require 
increasingly  larger  numbers  of  hairpins  (e.g.,  6  bp  =1024, 
7  bp  =  2048)  or  a  pooling  strategy  for  simple  laboratory 
implementation.  Unlike  complementary  techniques,  the 
FID  assay  is  nondestructive,  providing  the  opportunity  for 
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hairpin  immobilization  onto  reusable  supports  (chips, 
beads,  or  glass  slides),  thus  removing  the  barrier  to 
comprehensive  and  repeated  screening  of  sequences 
longer  than  that  presently  exemplified. 
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